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ABSTRACT: 

Two new isolates of the Hepatitis C virus (HCV), J1 and J7, are disclosed. These new isolates 
comprise nucleotide and amino acid sequences which are distinct from the prototype HCV 
isolate, HCV1 . Thus, J1 and J7 provide new polynucleotides and polypeptides for use, inter alia, 
in diagnostics, recombinant protein production and vaccine development. 
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0 New HCV isolates. 

0 Two new isolates of the Hepatitis C virus (HCV). J1 and J7. are disclosed. These new isolates comprise 
nucleotide and amino acid sequences which are distinct from the prototype HCV isolate, HCVt. Thus. J1 and J7 
provide new polynucleotides and polypeptides for use. inter alia, in diagnostics, recombinant protein production 
and vaccine development. 
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NEW HCV ISOLATES 



Technical Field 

The present invention relates to new isolates of the viral class Hepatitis C, polypeptides, poly- 
nucleotides and antibodies derived therefrom, as well as the use of such polypeptides, polynucleotides and 
5 antibodies in assays (e.g.. immunoassays, nucleic acid hybridization assays, etc.) and in the production of 
viral polypeptides. 



Background 

TO 

Non-A. Non-B hepatitis (NANBH) is a transmissible disease or family of diseases that are believed to be 
viral-induced, and that are distinguishable from other forms of viral-associated liver diseases, including that 
caused by the known hepatitis viruses, i.e., hepatitis A virus (HAV), hepatitis B virus (HBV), and delta 
hepatitis virus (HDV), as well as the hepatitis induced by cytomegalovirus (CMV) or Epstein-Barr vims 

T5 (EBV). NANBH was first identified in transfused individuals. Transmission from man to chimpanzee and 
serial passage in chimpanzees provided evidence that NANBH is due to a transmissible infectious agent or 
agents. Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borne 
epidemic type; the blood or needle associated type: and the sporadically occurring (community acquired) 
type. However, until recently, no transmissible agent responsible for NANBH had not been identified. 

20 Clinical diagnosis and identification of NANBH has been accomplished primarily by exclusion of other 
viral markers- Among the methods used to detect putative NANBH antigens and antibodies are agar-get 
diffusion, counterimmunoelectrophoresis, immunofluorescence microscopy, immune electron microscopy, 
radioimmunoassay, and enzyme-linked immunosorbent assay. However, none of these assays has proved 
to be sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH. 

25 Until recently there has been neither clarity nor agreement as to the identity or specificity of the antigen 
antibody systems associated with agents of NANBH. It is possible that NANBH is caused by more than one 
infectious agent and unclear what the serological assays detect in the seaim of patients with NANBH. 

In the past, a number of candidate NANBH agents were postulated. See, e.g., Prince (1983) Ann. Rev. 
Microbiol. 37 :217: Feinstone & Hoofnagle (1984) New Eng. J. Med. 31^ :185: Overby (1985) Cun-. Heptol. 5 

30 :49; Overby~(1986) Curr. Heptol 6 :65; Overby (1987) Curr. Heptol. 7 :35: and Iwarson (1987) British Med. J. 
295 :946. However, there is no proof that any of these candidates represent the etiological agent of NANBH. 

In 1987, Houghton et al. cloned the first vims definitively linked to NANBH. See. e.g.. EPO Pub. No. 
318.216; Houghton et al.. Science 244 :359 (1989). Houghton et al. described therein the cloning of an 
isolate from a new viral class. hepatitiTC virus (HCV). the prototype isolate described therein being named 

35 "HCV1". HCV is a Flavi-like virus, with an RNA genome. Houghton et al. described the production of 
recombinant proteins from HCV sequences that are useful as diagnostic reagents, as well as poly- 
nucleotides useful in diagnostic hybridization assays and in the cloning of additional HCV isolates. 

The demand for sensitive, specific methods for screening and identifying carriers of NANBH and 
NANBH contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs in 

40 approximately 10% of transfused patients, and NANBH accounts for up to 90% of these cases. There is a 
frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of transmission of NANBH by blood and blood products or by 
close personal contact require reliable diagnostic and prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBH. In addition, there is also a need for effective vaccines and immunotherapeutic 

45 therapeutic agents for the prevention and/or treatment of the disease. 

While at least one HCV isolate has been identified which is useful in meeting the above needs, 
additional isolates, particulariy those with divergent a genome, may prove to have unique applications. 



50 Summary of the Invention 

New isolates of HCV has been characterized from Japanese blood donors who have been implicated as 
NANBH carriers. These isolates exhibit nucleotide and amino acid sequence heterogeneity with respect to 
the prototype isolate, HCV1. in several viral domains. It is believed that these distinct sequences are of in 
importance, particularly in diagnostic assays and in vaccine development. 
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in one embodiment, the present invention provides a DNA molecule comprising a nucleotide sequence 
of at least 15 bp from an HCV isolate substantially homologous to an isolate selected from the group Jl or 
J7 wherein said nucleotide sequence is distinct from the nucleotide sequence of HCV isolate HCV1 . 
' in another embodiment, the present invention provides a DNA" molecule comprising a nucleotid 
5 sequence of at least 15 bp encoding an amino acid sequence from a HCV isolate Ji or J7 wherein the J1 
or J7 amino acid sequence is distinct from the amino acid sequence of HCV isolate HCV1 . 

Yet another embodiment of the present invention provides a purified polypeptide comprrsing an amino 
acid sequence from an HCV isolate substantially homologous to an isolate selected from the group Jl and 
J7. wherein said amino acid sequence is distinct from the sequence of the polypeptides encoded by the 

ta HCV isolate HCV1. ■ ^ „^.a 

Still another embodiment of the present invention provides a polypeptide compnsing an ammo acid 
sequence from a HCV isolate J1 or J7 wherein the J1 or J7 amino acid sequence is distinct from the ammo 
acid sequence of HCV isolate HCVl and the polypeptide is immobilized on a solid support. 

In a further embodiment of the present invention, an immunoassay for detecting the presence of anti- 

T5 HCV antibodies in a test sample is provided comprising: (a) incubating the test sample under conditions 
that allow the formation of antigen-antibody complexes with an immunogenic polypeptide comprising an 
amino acid sequence from an HCV isolate substantially homologous to an isolate selected from the group 
J1 and J7. wherein the amino acid sequence Is distinct from the amino acid sequence of HCV isolate 
HCV1 ■ and (b) detecting an antigen-antibody complex comprising the immunogenic polypeptde. 

20 The present invention also provides a composition comprising anti-HCV antibodies that bind an HCV 
epitope substantially free of antibodies that do not bind an HCV epitope, wherein: (a) the HCV epitope 
comprises an amino acid sequence from an HCV isolate substantially homologous to an isolate selected 
from the group Jl and J7, wherein the amino acid sequence is distinct from the amino acid sequence of 
HCV isolate HCVl: and (b) the Jl or J7 amino acid sequence is not immunologically cross-reactive with 

25 HCVl 

A further embodiment of the present invention provides an immunoassay for detecting the presence of 
an HCV polypeptide in a test sample comprising: (a) incubating the test sample under conditions that allow 
the formation of antigen-antibody complexes with anti-HCV antibodies that bind an HCV epitope wherein: (.) 
the HCV epitope comprises an amino acid sequence from a HCV isolate J1 or J7: (ii) the J1 or J7 ammo 

30 acid sequence is distinct from the amino acid sequence of HCV isolate HCVl; and (iii) the Jl or J7 ammo 
acid sequence is not immunologically cross-reactive with HCVl; and (b) detecting an antigen-antibody 
corriDtex comprising the anti-HCV antibodies. . . 

Also provided by the present invention is a method of producing anti-HCV antibodies comprising 
administering to a mammal a polypeptide comprising an amino acid sequence from a HCV 'soiate J1 or J7 

35 wherein the Jl or J7 amino acid sequence is distinct from the amino acid sequence of HCV isolate HCV1 
whereby the mammal produces anti-HCV antibodies. 

Yet another embodiment of the present invention provides a method of detecting HCV polynucleotides 
in a test sample comprising: (a) providing a probe comprising the DNA molecule of claim ^ : f > contacb"g 
the test sample and the probe under conditions that allow for the formation of a polynucleotide duplex 

« between the probe and its complement in the absence of substantial polynucleotide duplex forma ion 
between the probe and non-HCV polynucleotide sequences present in the test sample; and (c) detecting 

any polynucleotide duplexes comprising the probe. „w,n^r,t 

A still further embodiment of the present invention provides a method of producing a recombinant 
polypeptide comprising an HCV amino acid sequence, the method comprising: (a) providing host cells 

45 transformed by a DNA construct comprising a control sequences for the host cell ope^^'^ly """^f ^ ° ^ 
coding sequence encoding an amino acid sequence from a HCV isolate Jl or J7 wherein the Jl or J7 
. amino acid sequence is distinct from the amino acid sequence of HCV isolate HCVl; (b) growing the hos 
cells under conditions whereby the coding sequence transcribed and translated into the recombinant 
polypeptide; and (c) recovering the recombinant polypeptide. . , 

so These and other embodiments of the present invention will be readily apparent to those of ordinary skill 
in the art in view of the following description. 



Brief Description of the Figures 



ss 



Rgure 1 shows the consensus sequence of the coding strand of a fragment from the J7 C E domain with 
Rgu*I? 2°showTSie consensus sequence of the coding strand of a fragment from the J1 E domain with 
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Rgure^a Sows'me' consensus sequence of the coding strand of a fragment of the J1 E/NS1 domain with 

the heterogeneities. ^ ^ „ moo a o;- 

Rgure 4 shows the consensus sequence of the coding strand of a fragment from the J1 NS3 domain 

s with the heterogeneities. ... k,oc ^ 

Rgure 5 shows the consensus sequence of the coding strand of a fragment from the J1 NS5 doman 

with the heterogeneities. . . , »u„ 

Rgure 6 shows the homology of the J7 C/E consensus sequence with the nucleotide sequence of the 

same domain from HCV1 . ^ ^ , 

10 Rgure 7 shows the homology of the J1 E consensus sequence with the nucleotde sequence of the 

same domain from HCV1 . 

Rgure 8 shows the homology of the J1 E/NS1 consensus sequence with the nucleotide sequence of the 
same domain from HCVl . 

Rgure 9 shows the homology of the J1 NS3 consensus sequence with the nucleotide sequence of the 
75 same domain from HCV1 . .... « 

Rgure 10 shows the homology of the Jl NS5 consensus sequence with the nucleotide sequence of the 

same domain from HCVl . 

Rgure 11 shows the putative genomic organization of the HCV1 genome. 

Rgure 12 shows the nucleotide sequence of the ORF of HCVL In the figure nucleotide number 1 is the 
20 first A of the putative initiating methionine of the large ORF; nucleotides upstream of this nucleotide are 

numbered with negative numbers. x u, n moh m 

Figure 13 shows the consensus sequence of the ceding strand of a fragment from the Jl NS1 doman 
(J1 1519) with the nucleotide sequence of the same domain from HCVl. Also shown are the ammo acids 

encoded therein. • , i« -.u 

25 Rgure 14 shows a composite of the consensus sequence from the core to the NS1 domain of Jl with 
the nucleotide sequence of the same domain from HCVl. Also shown are the amino ac.ds encoded 

Figure"'l5 shows a consensus sequence of the coding strand of the NSl domain of Jl, as determined in 
Example IV. Also shown are the nucleotide sequence of the same domain from HCVl. and the ammo 

30 acids encoded in the HCVl and Jl sequences. w„„,=„ 
Figure 16 shows a consensus sequence of a coding strand of the C200 region of the NS3-NS4 domain 
of Jl. Also shown are the nucleotide sequence of the same domain from HCVl. Also shown are the 
amino acids encoded in the sequences. ^ . 

Rgure 17 shows a consensus sequence of the coding strand of the NSl domain of Jl, as determined in 

35 Example V. Also shown are the nucleotide sequence of the same domain from HCVl. and the ammo 
acids encoded in the sequences. 

Figure 18 shows a consensus sequence of the coding strand of the untranslated and core domains of Jl. 
Also shown are the nucleotide sequence of the same domain from HCVl. and the amino acids encoded 
in the sequences. 



40 



Detailed Description of the Invention 



The practice of the present invention will employ, unless otherwise indicated, conventional techniques 
of molecular biology, microbiology, recombinant DNA techniques, and immunology, which are within the 

45 skill of the art. Such techniques are explained fully in the literature. See e.g., Maniatis. Fitsch & Janibrook 
MOLECULAR CLONING: A LABORATORY MANUAL (1982): DNA CLONING, VOLUMES I AND II (D^N 
Sover ed 1985): OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984): NUCLEIC ACID HYBRIDIZATION 
(B D Hames & S J Higgins eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins 
eds "1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL 

50 Press 1986); B. Perbal. A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); the series. METHODS 
IN ENZYMOLOGY (Academic Press. Inc.): GENE TRANSFER VECTORS FOR 11 MAMMALIAN CELLS 
(J HI Miller and MP. Calos eds. 1987. Cold Spring Harbor Laboratory). Methods in Enzymology Vol. 154 
and Vol 155 (Wu and Grossman, and Wu, eds., respectively). Mayer and Walker, eds. (1987). IM- 
MUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press. London). Scopes. 

55 (19871 PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE. Second Edition (Springer-Verlag. N.Y.). 
and HANDBOOK OF EXPERIMENTAL IMMUNOLOGY, VOLUMES l-IV (D.M. Weir and C. C. Blackwell eds 
1986). All patents, patent applications, and other publications mentioned herein, both supra and infra, are 
hereby Incorporated herein by reference. 
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The term "hepatitis C vims" has been reserved by workers in the field for an heretofore unknown 

NANB hepaMS J*^'- r ^.^^ herein, denotes a viral species of which pathogenic 

HC?ra FlavWike virus The morphology and composition of Flavivirus particles are known and are 
HCV is aFlavi ikevrus^ ine P JOQAVIRIDAE AND FLAVIVIRIDAE (Senes eds. 

Therefore, there are multiple strains, which may be virulent or av.rulent^ ^l^'^J'^.^ Ip o" mS 9 000 

It is believed that the genome of HCV isolates is compnsed of a smgle ORF o^/P^^^^ ™^ 
nucl otides to approKimatery 12.000 nuc.eo«des. encoding 

encoded polyprotein of similar hydrophobic and antigenic character to that of "CV1^and_the p^^^^^^ 
co-linear peptide sequences that are conserved with HCV1. In addition, the genome is believed to be a 

'°';i;"eTof1cTcompri^ epitopes that are immunologically cross-reactive with epitopes in the HCV1 
gen ma At ^ast some of these are epitopes unique to HCV when cor^pared to other ^lav.^^^^^^^^^ 
Te uniqueness of the epitope may be determined by its immunological reactivity ^"^'^^^ f^^^^^^^ 

lut TaTcl^tiSes L are fully homologous. The correspondence between the sequence rom a new 
^cSite and ml^CV equence can be determined by techniques known in the art. For example^ they can 
be determined by a direct comparison of the sequence information of the polynucleotide f^o"^ new 
isolate ITd HCV? sequences. Alternatively, homology can be determined by hybndization of the poly- 
nuTotldes under conditions which form stable duplexes between homologous regions (for example, tho e 
S worSe used prior to S, digestion), followed by digestion with single-stranded specific nuclease(s). 
followed by size determination of the digested fragments. 

Because of the evolutionary relationship of the strains or isolates of HCV. putative HCV strains or 
isolates rident fiable by their homology at the polypeptide level. Thus, new HCV isolates are expected to 

more than about^J^^ homologous, probably more than about 70% homologous, and even more 
^fobab V ^re ^an about 80% ^orSologoSs. and possibly even more than about 90% homologous at the 
TotSi Je lXe^ determining amino acid sequence homology are known in the art. For 

X m7e th -ino sequence may be determined directly and comp^ed to the sequen p^vided 
ho«in AlfPrnativelv the nucleotide sequence of the genomic material of the putative HCV may oe 
. Setrminerramino acid sequence encoded therein can be determined, and the corresponding regions 

'"TfoRF of HCV1 is Shown in Figure 12. The non-structural. core, and envelope domains of the 
poly^otefn hav been predicted for HCV1 (Figure 5). The "C". or core, poypeptide b« 
Seeded flom the 5 terminus te about nucleotide 345 of HCV1. The putative ^"^ ^^^^^^'^^^.f 7" °' 
s HCV?!s beltevld to be encoded from about nucleotide 346 to about nucleotide -P^^^to'tide 9 3 Fo'r 
.rucura. one domain, is thoug. to be encod. from -—^^^irreor^^^^^^ 

3013 te about nucleotide 4SS0. putative NS4 from 
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about nucleotide 4951 to about nucleotide 6297. and putative NS5 from about nucfeotid 6298 to the 3 
terminus respectively. The above boundaries are approximations based on an analysis of the ORF. The 
exact boundari s can be determined by those skilled in the art in view of the disclosure herein. 

"HCV/J1 " or "J1 " and "HCV/J7" or "J7" refer to new HCV Isolates characterized by the nucleotide 

5 sequence disclosed herein, as well as related isolates that are substantially homologous thereto; i.e.. at least 
about 90% or about 95% at the nucleotide level. It is believed that the sequences disclosed herein 
characterize an HCV subclass that is predominant in Japan and other Asian and/or Pacific rim countries. 
Additional J1 and J7 Isolates can be obtained in view of the disclosure herein and EPO pub. No. 318.216. In 
particular, the J1 and J7 nucleotide sequences disclosed herein, as well as the HCV1 sequences in Rgure 

10 12, can be used as primers or probes to clone additional domains of J1 . J7. or additional isolates. 

' As used herein, a nucleotide sequence "from" a designated sequence or source refers to a nucleotide 
sequence that is homologous (i.e.. identical) to or complementary to the designated sequence or source, or 
a portion thereof. The J1 sequences provided herein are a minimum of about 6 nucleotides, preferably 
about a nucleotides, more preferably about 15 nucleotides, and most preferably 20 nucleotides or longer. 

15 The maximum length is the complete viral genome. 

In some aspects of the invention, the sequence of the region from which the polynucleotide is denved 
is preferably homologous to or complementary to a sequence which is unique to an HCV genome or the J1 
and J7 genome. Whether or not a sequence is unique to a genome can be determined by techniques 
known to those of skill in the art. For example, the sequence can be compared to sequences In databanks. 

20 e.g.. Genebank. to determine whether it is present in the uninfected host or other organisms. The sequence 
can also be compared to the known sequences of other viral agents. Including those which are known to 
induce hepatitis, e.g.. HAV, HBV. and HDV. and to other members of the Flaviviridae. The correspondence 
or non-correspondence of the derived sequence to other sequences can also be determined by hybridiza- 
tion under the appropriate stringency conditions. Hybridization techniques for determining the complemen- 

25 tarity of nucleic acid sequences are known in the art. See also, for example. Maniatis et al. (1982) 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor Press, Cold Spring Harbor, N.Y.). 
In addition, mismatches of duplex polynucleotides formed by hybridization can be determined by known 
techniques, including for example, digestion with a nuclease such as SI that specifically digests single- 
stranded areas in duplex polynucleotides. Regions from which typical DNA sequences may be derived 

30 include, but are not limited to. regions encoding specific epitopes, as well as non-transcribed and/'or non- 
translated regions. 

The Ji of J7 polynucleotide is not necessarily physically derived from the nucleotide sequence shown, 
but may be generated in any manner, including for example, chemical synthesis or DNA replication or 
reverse transcription or transcription. In addition, combinations of regions corresponding to that of the 
35 designated sequence may be modified in ways known in the art to be consistent with an intended use. The 
polynucleotides may also include one or more labels, which are known to those of skill in the art. 

An amino acid sequence "from" a designated polypeptide or source of polypeptides means that the 
amino acid sequence is homologous (i.e., identical) to the sequence of the designated polypeptide, or a 
portion thereof. An amino acid sequence "from" a designated nucleic acid sequence refers to a polypeptide 
40 having an amino acid sequence identical to that of a polypeptide encoded in the sequence, or a portion 
thereof. The JI or J7 amino acid sequences in the polypeptides of the present invention are at least about 
5 amino acids in length, preferably at least about 10 amino acids, more preferably at least about 15 amino 
acids, and most preferably at least about 20 amino acids. 

The polypeptides of the present invention are not necessarily translated from a designated nucleic acid 
45 sequence; the polypeptides may be generated in any manner, including for example, chemical synthesis, or 
expression of a recombinant expression system, or isolation from virus. The polypeptides may include one 
or more analogs of amino acids or unnatural amino acids. Methods of inserting analogs of amino acids into 
a sequence are known in the art. The polypeptides may also include one or more labels, which are known 
to those of skill in the art. 

50 The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA. 
semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is linked to a 
polynucleotide other than that to which it is linked in nature, or (2) does not occur in nature. 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of the molecule. 

55 Thus, this term includes double- and single-stranded DNA, and RNA. It also includes known types of 
modifications, for example, labels which are known in the art, methylation, "caps", substitution of one or 
more of the naturally occurring nucleotides with an analog, internucieotide modifications such as, for 
example, those with uncharged linkages (e.g.. methyl phosphonates, phosphotriesters, phosphoamidates. 



6 



EP 0 419 182 A1 



carbamates, etc.) and with charged linkages (e.g.. phosphorothioates, phosphorod.th,oates. etc . those 
SSnl^g pendant moieties, such as. for example proteins (Including for e.g., nucleases, toxins. ant,bod.e^^^ 
S StWes POly-L-lysine, etc.). those with intercalators (e.g.. acridine. psoralen, tc). tho^e « 
chell^ (e g . metals, radioactive metals, boron, oxidative metals, etc.), those contam.ng alkylators. those 
5 S modi?^ linkages (e.g.. alpha anomeric nucleio acids, etc.). as well as unmodified forms of the 

'^''"puSpolynucleotide" refers to a composition comprising a specified polynucleotide that is substen- 
tially h-rSoS components, such composition typically comprising at least about 70% of the speafied 
poln^Lt^de lre Spic^lV at least about 80%. 90% or even 95% to 99% of the spec.f.ed poly- 

""""SL polypeptide- refers to a composition comprising a specified polypeptide that ;s subs^a^y 
free of o2 components, such composition typically comprising at least about 70% o the specified 

parent, due to natural, accidental, or deliberate mutation. 
20 A -replicon" is any genetic element, e.g.. a plasmid. a chromosome, a virus jj" 
behaves as autonomous unit of polynucleotide replication within a ceil: i.e.. capable of replication under 

''°rSng vector" is a replicon that can transform a selected host cell and in -^^^ 
polynucleorsegment is attached, so as to bring about the replication and-or ^^'^^^^^^^^^^ 
25 s^ment Typically, cloning vectors include plasmids. virus (e.g.. bactenophage ^'ec tor) and cosm^ds. 
^ A -integra Jg vector" is a vector that does not behave as a replicon in ^^^'fj^ . ^^^^^^^^^^^ 
the ability to int^rate into a replicon (typically a chromosome) resident in the selected host to stably 

'''"ArJxpres°sion vector" is a construct that can transform a selected host cell and provides ,or 
30 expression orrSeroCus coding sequence in the selected host. Expression vectors can be either a 

"^V^^SZ^^^T^ ^nucleotide seguence which is transcribed into mRNA andor translated 
into a ^ySt de when placed under the control of appropriate regulatory sequences. The bo-nJar.es o 
he coding 'sequence are determined by a translation start codon at the ^'^^^^^^^^^^'^Z and 
35 codon at the 3'-terminus. A coding sequence can include, but is not limited to mRNA. cDNA. and 

'^'^^^':^XT:'^,r.^^.o.6. regulatory sequences which are necessary to e«ect 
expre^ion Of coding sequences to which they are ligated. The nature of such con^ol 
deoend ng upon the host organism. In prokaryotes. control sequences generally include promote . 
« Smaf b ndng sue, and terminators. In eukaryotes generally control sequences include promote s^ 
termTnTto std. I some instances, enhancers. The term "control sequences" is intended to -nclude at a 
Smum arcomponents the presence of which are necessary for expression, and may also include 

"^'CaCrrre^^rr: il^^^ wherem the components so described - in a ..a^^^^^^^^^ 
45 oermittinq them to function in their intended manner. A control sequence "operably linked to a cc^ng 
seZte i^gated in such a way that expression of the coding sequence is achieved under condibons 

=°T":rgtm^^^^ is a region of a polynucleotide sequence w.ch encodes a 

oolvoSotide this region may represent a portion of a coding sequence or a total coding sequence. 
50 ' '"ImCnoS^^^ refers to two or more epitopes or polypeptides that are bound by the 

same anTbody ■ Sss-reactivlty can be determined by any of a number of immunoassay techniques, such 

VsTdte^e^^the torm "antibody" refers to a polypeptide or group of PfVPePtides which c^^^^^^^^^ 
least one epitope. An "antigen binding site" is formed from the folding and ch r^e 

« antihodv moleculefs) to form three-dimensional binding sites with an internal surface shape and charge 
"s ribu orcomplem^ the features of an pitope of an antigen, which allows specific binding to fomj 

a^ antT^^^^^^^ complex. An antigen binding site may be formed from a ^^^avy- and;or light-chan 
romaiMVH ^d VL, respectively), which form hypervariable loops which contribute to antigen binding. The 
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term "antibody" includes, without limitation, chimeric antibodies, altered antibodles.univalent antibodies. Fab 
proteins, and single-domain antibodies. In many cases, the biding phenomena of antibodies to antigens is 
equivalent to other iigand/anti-ligand binding. 

As used herein, a "single domain antibody" (dAb) is an antibody which is comprised of an HL domain. 

5 which binds specifically with a designated antigen. A dAb does not contain a VL domain, but may contain 
other antigen binding domains known to exist to antibodies, for example, the kappa and lambda domains. 
Methods for preparing dAbs are known in the art. See* for example. Ward et al. Nature 341^ : 544 (1989). 

Antibodies may also be comprised of VH and VL domains, as well as other known antigen binding 
domains. Examples of these types of antibodies and methods for their preparation and known in the art 

10 (see, e.g., U.S. Patent No. 4,816.467, which is incorporated herein by reference), and include the following. 
For example, "vertebrate antibodies" refers to antibodies which are tetramers or aggregates thereof, 
comprising light and heavy chains which are usually aggregated in a "Y" configuration and which may or 
may not have covatent linkages between the chains. In vertebrate antibodies, the amino acid sequences of 
the chains are homologous with those sequences found in antibodies produced in vertebrates, whether In 

15 situ or in vitro (for example, in hybridomas). Vertebrate antibodies Include, for example, purified polyclonal 
antibodies and monoclonal antibodies, methods for the preparation of which are described infra. 

"Hybrid antibodies" are antibodies where chains are separately homologous with reference to mamma- 
lian antibody chains and represent novel assemblies of them, so that two different antigens are precipitable 
by the tetramer or aggregate. In hybrid antibodies, one pair of heavy and light chains are homologous to 

20 those found in an antibody raised against a first antigen, while a second pair of chains are homologous to 
those found in an antibody raised against a second antibody. This results in the property of "divalence". 
i.e., the ability to bind two antigens simultaneously. Such hybrids may also be formed using chimeric 
chains, as set forth below. 

"Chimeric antibodies" refers to antibodies In which the heavy and/or light chains are fusion proteins, 
25 Typically, one portion of the amino acid sequences of the chain is homologous to con-esponding sequences 
in an antibody derived from a particular species or a particular class, while the remaining segment of the 
chain is homologous to the sequences derived from another species and/or class. Usually, the variable 
region of both light and heavy chains mimics the variable regions or antibodies derived from one species of 
vertebrates, while the constant portions are homologous to the sequences in the antibodies derived from 
30 another species of vertebrates. However, the definition is not limited to this particular example. Also 
included is any antibody in which either or both of the heavy or light chains are composed of combinations 
of sequences mimicking the sequences in antibodies of different sources, whether these sources be from 
differing classes or different species of origin, and whether or not the fusion point is at the variabla'constant 
boundary. Thus, it is possible to produce antibodies In which neither the constant nor the variable region 
j5 mimic know antibody sequences. It then becomes possible, for example, to constmct antibodies whose 
variable region has a higher specific affinity for a particular antigen, or whose constant region can elicit 
enhanced complement fixation, or to make other Improvements in properties possessed by a particular 
constant region. 

Another example is "altered antibodies", which refers to antibodies in which the naturally occurring 

40 amino acid sequence in a vertebrate antibody has been varies. Utilizing recombinant DNA techniques, 
antibodies can be redesigned to obtain desired characteristics. The possible variations are many, and range 
from the changing of one or more amino acids to the complete redesign of a region, for example, the 
constant region. Changes in the constant region, in general, to attain desired cellular process characteris- 
tics, e.g.. changes in complement fixation, interaction with membranes, and other effector functions. 

45 Changes in the variable region may be made to alter antigen binding characteristics. The antibody may also 
be engineered to aid the specific delivery of a molecule or substance to a specific cell or tissue site. The 
desired alterations may be made by known techniques in molecular biology, e.g.. recombinant techniques, 
site-directed mutagenesis, etc. 

Yet another example are "univalent antibodies", which are aggregates comprised of a heavy-chain/light- 

50 chain dimer bound to the Fc (i.e., stem) region of a second heavy chain. This type of antibody escapes 
antigenic modulation. See. e.g., Glennie et al. Nature 295 : 712 (1982). Included also within the definition of 
antibodies are "Fab" fragments of antibodies. The "Fab" region refers to those portions of the heavy and 
light chains which are roughly equivalent, or analogous, to the sequences which comprise the branch 
portion of the heavy and light chains, and which have been shown to exhibit immunological binding to a 

55 specified antigen, but which lack the effector Fc portion. "Fab" includes aggregates of one heavy and one 
light chain (commonly known as Fab), as well as tetramers containing the 2H and 2L chains (referred to as 
F(ab)2). which are capable of selectively reacting with a designated antigen or antigen family. Fab 
antibodies may be divided into subsets analogous to those described above, i.e.. "vertebrate Fab", "hybrid 
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Fab" "chimeric Fab", and "altered Fab". Methods of producing Fab fragments of antibodies are known 
within the art and include, for example, proteolysis, and synthesis by recombinant t chmques. 

-Epitope- refers to an antibody binding site usually defined by a polypeptide, but also by non-ammo 
acid haptens. An epitope could comprise 3 amino acids in a spatial conformation which is un.que to ttie 
epitope, generally an epitope consists of at least 5 such amino acids, and more usually, consists of at least 

8-10 such amino acids. .„ „ . . . 

-Antigen-antibody complex" refers to the complex formed by an antibody that is specifically bound to 

an eoitoDe on an antigen. ^ . _ ^ 

"Immunogenic polypeptide" refers to a polypeptide that elicits a cellular and/or humoral immune 
response in a mammal, whether alone or linked to a carrier, in the presence or absence of an adjuvant 

"Polypeptide" refers to a polymer of amino acids and does not refer to a specific length of tfie 
molecule Thus, peptides, oligopeptides, and proteins are included v^thin the definition '^'j^^^P'^f l^^ 
term also does not refer to or exclude post-expression modifications of the polypeptide, for examp e, 
glycosylations. acetylations. phosphorylations and the like. Included within the definition ^^J^'^^^^-^ 
polypeptides containing one or more analogs of an amino add (including for «'<=7'«; ^f^*"^^' ^'"J 
acids, etc.), polypeptides with substituted linkages, as well as other modifications known in the art. both 
naturally occurring and non-naturally occurring. 

"Transformation", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for example, direct uptake. »^f"f "'/"'"/^"S °^ . 
electroporation. The exogenous polynucleotide may be maintained as a non-integrated vector, for example, 
a plasmid, or alternatively, may be integrated into the host genome. 

A "transformed" host cell refers to both the immediate cell that has undergone transformation and its 
progeny that maintain the originally exogenous polynucleotide. 

"Treatment" as used herein refers to prophylaxis and.'or therapy. 
25 "Individual", refers to vertebrates, particulariy members of the mammalian species, and includes but is 
not limited to domestic animals, sports animals, and primates, including humans ^om. 

"Sense strand" refers to the strand of a double-stranded DNA molecule that .s homologous o a mRNA 
transcript thereof. The "anti-sense strand" contains a sequence which is complementary to that of the 
**s9ns6 strand" 

00 -Antibody-containing body component" refers to a component of an individual's body which is a source 
of the antibodies of interest. Antibody-containing body components are known in the art. and mclude but are 
not limited to. whole blood and components thereof, plasma, semm, spinal fluid, lymph fluid, the external 
sections of the respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, white blood cells, and 

""^^ipurified HCV" isolate refers to a preparation of HCV particles which has been isolated from the cellular 
constituents with which the virus is normally associated, and from other types of ^'^f 'r,!^ 
present in the infected tissue. The techniques for isolating viruses are known to those of skill m the art, and 
include, for example, centrifugation and affinity chromatography. 

An HCV "particle" is an entire virion, as well as particles which are intermediates m virion formation. 
40 HCV particles generally have one or more HCV proteins associated with the HCV nucleic acid. 

"Probe" refers to a polynucleotide which fomis a hybrid structure with a sequence m a target 
polynucleotide, due to complementarity of at least one region in the probe with a region ,n the target. 

"Biological sample" refers to a sample of tissue or fluid isolated from an individual including but not 
limited to, for example, whole blood and components thereof, plasma, serum, spinal fluid, 'V^^P^] f^'^; 
45 external sections of the skin, respiratory. intesHnal. and genitourinary tracts, tears, saliva, milk, blood cells. 
ZoTs. organs, and also samples of in vitro cell culture constituents (including but r^ot limued to 
conditioned medium resulting from the gTowlTof cells in cell culture medium, putatively virally infected 

cells, recombinant cells, and cell components). . , ^. ^ nf Mru ii and 

The invention pertains to the isolation and characterization of a newly discovered isolate of HCV^ Ji and 
50 J7. their nucleotide sequences, their protein sequences and resulting polynucleotides, polypeptides and 

antibodies derived therefrom. Isolates Jl and J7 are novel in their nucleotide and ammo acid sequences. 

and is believed to characteristic of HCV isolates from Japan and other Asian countnes. 

The nucleotide sequences derived from HCV/J1 and HCV07 are useful as probes to ^'agnose the 

presence of virus in samples, and to isolate other naturally occurring variants of the virus^ These nucleotide 
55 sequences also make available polypeptide sequences of HCV antigens encoded w.thm he J and J7 

genome and permits the production of polypeptides which are useful as standards or reagents in diagnostic 

Tests and;or as components of vaccines. Antibodies, both polyclonal and monoclonal, directed against HQV 

epitopes contained within these polypeptide sequences are also useful for diagnostic tests, as therapeutic 
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agents, for screening of antiviral agents, and for the isolation of the NANBH virus. In addition, by utilizing 
probes derived from the sequences disclosed herein it is possible to isolate and sequence other portions of 
the J1 and J7 genome, thus giving rise to additional probes and polypeptides which are useful in the 
diagnosis and/or treatment, both prophylactic and therapeutic, of NANBH. 

5 The availability of the HCV/J1 and HCV/J7 nucleotide sequences enable the construction of poly- 
nucleotide probes and polypeptides useful in diagnosing NANBH due to HCV Infection and in screening 
blood donors as well as donated blood and blood products for infection. For example, from the sequences it 
is possible to synthesize DNA oligomers of about 8-10 nucleotides, or larger, which are useful as 
hybridization probes to detect the presence of HCV RNA in. for example, sera of subjects suspected of 

10 harboring the virus, or for screening donated blood for the presence of the virus. The HCV/Jl and HCV/J7 
sequences also allow the design and production of HCV specific polypeptides which are useful as 
diagnostic reagents for the presence of antibodies raised during NANBH. Antibodies to purified polypep- 
tides derived from the HCV/Jl and HCV/J7 sequences may also be used to detect viral antigens in infected 
individuals and in blood. 

75 Knowledge of these HCV/Jl and HCV/J7 sequences also enable the design and production of 
polypeptides which may be used as vaccines against HCV and also for tiie production of antibodies, which 
in turn may be used for protection against the disease, and/or for therapy of HCV infected individuals. 
Moreover, the disclosed HCV/Jl and HCV/J7 sequences enable further characterization of the HCV 
genome. Polynucleotide probes derived from these sequences, as well as from the HCV genome, may be 

20 used to screen cDNA libraries for additional viral cDNA sequences, which, in turn, may be used to obtain 
additional overiapping sequences. See, e.g.. EPO Pub. No. 318,216. 

The HCV/Jl and HCV/J7 polynucleotide sequences, the polypeptides derived therefrom and the 
antibodies directed against these polypeptides, are useful in the isolation and identification of the BB- 
NANBV agent(s). For example, antibodies directed against HCV epitopes contained in polypeptides derived 

25 from the HCV/J1 sequences may be used in processes based upon affinity chromatography to isolate the 
virus. Alternatively, the antibodies may be used to identify viral particles isolated by other techniques. The 
virai antigens and the genomic material within the isolated viral particles may then be further characterized. 

The information obtained from further sequencing of the HCV/Jl and HCV/J7 genome, as well as from 
further characterization of \he HCV/Jl and HCV/J7 antigens and characterization of the genomes enable tine 

30 design and syntfiesis of additional probes and polypeptides and antibodies which may be used for 
diagnosis, for prevention, and for therapy of HCV induced NANBH, and for screening for infected blood and 
blood-related products. 

The availability of HCV/Jl and HCV/J7 cDNA sequences permits tiie construction of expression vectors 
encoding antigenically active regions of the polypeptide encoded in either strand. These antigenically active 
35 regions may be derived from coat or envelope antigens or from core antigens, or from antigens which are 
non-structural including, for example, polynucleotide binding proteins, polynucleotide polymerase(s). and 
otiier viral proteins required for the replication and/or assembly of the virus particle. Fragments encoding 
the desired polypeptides are derived from the cDNA clones using conventional rest-iction digestion or by 
synthetic methods, and are ligated into vectors which may. for example, contain portions of fusion 
40 sequences such as beta-galactosidase or superoxide dismutase (SOD). Methods and vectors which are 
useful for the production of polypeptides which contain fusion sequences of SOD are described in EPO 
Pub. No. 196.056. Vectors encoding fusion polypeptides of SOD and HCV polypeptides are described in 
EPO Pub. No. 3ia216- Any desired portion of the HCV cONA containing an open reading frame, in either 
sense strand, can be obtained as a recombinant polypeptide, such as a mature or fusion protein. 
45 Aitematively. a polypeptide encoded in tiie cDNA can be provided by chemical synthesis. 

The DNA encoding tiie desired polypeptide, whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host systems are presently used in forming recombinant 
polypeptides, and a summary of some of tiie more common control systems and host cell is given below. 
50 The polypeptide produced in such host cells is then isolated from lysed ceils or from the culture medium 
and purified to ttie extent needed for its intended use. Purification may be by techniques known in the art. 
for example, differential extraction, salt fractionation, chromatography on ion exchange resins, affinity 
chromatography, centrifugation, and the like. See, for example, Methods in Enzymology for a variety of 
methods for purifying proteins. 

Such recombinant or synthetic HCV polypeptides can be used as diagnostics, or those which give nse 
to neuti-alizing antibodies may be formulated into vaccines. Antibodies raised against these polypeptides 
can also be used as diagnostics, or for passive immunotherapy. In addition, antibodies to these polypep- 
tides are useful for isolating and identifying HCV particles. 
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The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected 
cells in tissue culture, or in an infected host. a substantiallv complet viral domain, in 
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Screening such peptides in an immunoassay is within the skill of the art. It is also known to carry out a 
computer analysis of a protein sequence to identify potential epitopes, and then prepare oligopeptides 
comprising the identified regions for screening. It is appreciated by those of skill in the art that such 
computer analysis of antigenicity does not always identify an epitope that actually exists, and can also 
incorrectly identify a region of the protein as containing an epitope. 

The observed relationship of the putative polyproteins of HCV and the Flaviviruses allows a prediction 
of the putative domains of the HCV "non-structural" (NS) proteins. The locations of the individual NS 
proteins in the putative Ravivirus precursor polyprotein are fairly well-knov/n. Moreover, these also coincide 
with observed grass fluctuations in the hydrophobicity profile of the polyprotein. It is established that NS5 of 
Haviviruses encodes the virion polymerase, and that NSI con^sponds with a complement fixation antigen 
which has been shown to be an effective vaccine in animals. Recently, it has been shown that a flaviviral 
protease function resides in NS3. Due to the observed similarities between HCV and the Flaviviruses. 
deductions concerning the approximate locations of the corresponding protein domains and functions in the 
HCV polyprotein are possible. Rgure 11 is a schematic of putative domains of the HCV polyprotein. The 
expression of polypeptides containing these domains In a variety of recombinant host cells, including, for 
example, bacteria, yeast, insect, and vertebrate cells, should give rise to important immunological reagents 
which can be used for diagnosis, detection, and vaccines. 

Although the non-stmctural protein region of the putative polyproteins of the HCV isolate described 
herein and of Raviviruses appears to be generally similar, there is less similarity between the putative 
structural regions which are towards the N-terminus. In this region, there is a greater divergence in 
sequence, and in addition, the hydrophobic profile of the two regions show less similanty. This 
"divergence" begins in the N-terminal region of the putative NSI domain in HCV. and extends to the 
presumed N-terminus. Nevertheless, it is still possible to predict the approximate locations of the putative 
nucleocapsid (N-terminal basic domain) and E (generally hydrophobic) domains within the HCV polyprotein. 
From these predictions it may be possible to identify approximate regions of the HCV polyprotein that could 
correspond with useful immunological reagents. For example, the E and NSI proteins of Flaviviruses are 
known to have efficacy as protective vaccines. These regions, as well as some which are shown to be 
antigenic in the HCV1, for example those within putative NS3. C. and NS5. etc.. should also provide 
diagnostic reagents. 

The immunogenicity of the HCV sequences may also be enhanced by preparing the sequences fused 
to or assembled with particle-forming proteins such as, for example, hepatitis B surface antigen or rotavirus 
VP6 antigen: Constmcts wherein the HCV epitope is linked directly to the particle-forming protein coding 
sequences produce hybrids which are immunogenic with respect to the HCV epitope. In addition, all of the 
vectors prepared include epitopes specific to HBV. having various degrees of immunogenicity. such as, for 
example, the pre-S peptide. Thus, particles constructed from particle forming protein which include HCV 
sequences are immunogenic with respect to HCV and particle-form protein. See. e.g., U.S. Pat. No. 
4,722.840: EPO Pub No. 175,261; EPO pub. No. 259.149; Michelle et ai. (1984) Int. Symposium on Viral 
Hepatitis. 

Vaccines may be prepared from one or more immunogenic polypeptides derived from HCV/J1 or 
HCV/J7. The observed homology between HCV and Flaviviruses provides infomiation concerning the 
polypeptides which are likely to be most effective as vaccines, as well as the regions of the genome in 
which they are encoded. The general structure of the Flavivinjs genome is discussed in Rice et al. (1986) in 
THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and Wagner. Vol 
ads. Schlesinger and Schlesinger, Plenum Press). The flavivirus genomic RNA is believed to be the only 
virus-specific mRNA species, and it is translated into the three viral stnjctural proteins, i.e.. C. M, and E. as 
well as two large nonstructoral proteins, NV4 and NV5, and a complex set of smaller nonstructoral proteins. 
It is known that major neutralizing epitopes for Flaviviruses reside in the E (envelope) protein. Roehrig 
(1986) in THE VIRUSES: THE TOGAVIRIDAE AND FLAVIVIRIDAE (Series eds. Fraenkel-Conrat and 
Wagner, Vol eds. Schlesinger and Schlesinger. Plenum Press). The con-esponding HCV E gene and 
polypeptide encoding region may be predicted, based upon the homology to Flaviviruses. Thus, vaccines 
may be comprised of recombinant polypeptides containing epitopes of HCV E. These polypeptides may be 
expressed in bacteria, yeast, or mammalian cells, or alternatively may be isolated from viral preparations. It 
is also anticipated that the other structural proteins may also contain epitopes which give rise to protective 
anti-HCV antibodies. Thus, polypeptides containing the epitopes of E, C, and M may also be used, whether 
singly or in combination, in HCV vaccines. 

In addition to the above, it has been shown that Immunization with NSI (nonstructoral protein 1). results 
in protection against yellow fever. Schlesinger et al (1986) J. Virol. 60 :1153. This is tme even though the 
immunization does not give rise to neutralizing antibodies. Thus, particularly since this protein appears to be 
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highly conserved among Flaviviruses. it is likely that HCV NS1 m\\ also be protectiv against HCV infection^ 
Moreover, it also shows that nonstructural proteins may provide protection against viral pathogenicly. even 
if thev do not cause the production of neutralizing antibodies. 

In view of the above, multivalent vaccines against HCV may be comprised of one or more epitopes 
from one or more structural proteins, and/or one or more epitopes from one or more nonstructural proteins, 
mese vaccines may be comprised of. for example, recombinant HCV polypeptides and/or polypeptides 
Isolated from the virions. In particular, vaccines are contemplated comprising one or more of the following 
HCV proteins, or subunit antigens derived therefrom: E. NS1. C. NS2. NS3, NS4 and NS5. Particularly 
preferred are vaccines comprising E and/or NSt. or subunits thereof. In addition, it may be possible to use 
inactivated HCV in vaccines: inactivation may be by the preparation of viral lysates. or by other means 
known in the art to cause inactivation of Flaviviruses. for example, treatment with organic so vents or 
detergents, or treatment with formalin. Moreover, vaccines may also be prepared frorn atte""ated HCV 
strains or from hybrid viruses such as vaccinia vectors known in the art [Brown et al. Nature 319 : 549-550 

^^^^The preparation of vaccines which contain immunogenic polypeptide(s) as active ingredients is known 
to one skilled in the art. Typically, such vaccines are prepared as injectables. either as 
suspensions: solid forms suitable for solution in. or suspension in. liquid pnor to injection may also be 
prepared. The preparation may also be emulsified, or the protein encapsulated m liposomes. The achve 
imrnunogenic ingredients are often mixed with exciplents which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable excipients are. for example, water, saline, dextrose, glycerol 
ethanol. or the like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts 
of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, and-or adjuvan s which 
enhance the effectiveness of the vaccine. Examples of adjuvants which may be effective mc ude b"t aje 
limited to: aluminum hydroxide. N-acetyl-muramyl-L-threonyl-D-isoglutam.ne (thr-MDP). f^-acetyl-nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 11637. referred to as nor-MDP), N-acetylmuramyl-L^-a an^^^^^ 
isoglutaminyl-L-alanine-2-(l'-2'-dipalmitoyl-sn-glycercl-3-hydroxyphosphoryloxy)-ethylamine (CGP ig^5A. 
referred to as MTP-PE). and RIBI, which contains three components f-^racted from bacteria, mon- 
ophosphoryl lipid A. trehalose dimycolate and cell wall skeleton (MPL + TDM + CWS) in a 2,. 
squalene-Tween BO emulsion. The effectiveness of an adjuvant may be determined by measunng the 
amount of antibodies directed against an immunogenic polypeptide containing an HCV an^genic sequence 
resulting from administration of this polypeptide in vaccines which are also compnsed of the various 

^"'"t^b vaccines are conventionally administered parenterally, by injection, usually, either subcutaneously 
or Intramuscularly. Additional formulations which are suitable for other modes of administration include 
suppositories and. in some cases, oral formulations. For suppositories, traditional binders and carriers may 
include, for example, polyalkylene glycols or triglycerides: such suppositories may be formed from mixtu es 
containing the active ingredient in the range of 0.5% to 10%. preferably l%-2%. Oral formulations indude 
such nomially employed excipients as. for example, phamiaceutical grades of mann|tol, lactose, starch 
magnesium stearate. sodium saccharine, cellulose, magnesium carbonate, and the I ke. These composittons 
take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders 
and contain l0%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated into the vaccine as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free amino groups of the peptide) and which are formed 
with inorganic acids such as. for example, hydrochloric or phosphoric acids, or such organic acids such as 
acetic, oxalic, tartaric, maleic. and the like. Salts formed with the free carboxyl groups may also be derived 
from inorganic bases such as. for example, sodium, potassium, ammonium, calcium, or feme hydroxides^ 
and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol. histidine. procaine, and 

The vaccines are administered in a manner compatible with the dosage formulation, and In such 
amount as will be prophylactically and-'or therapeutically effective. The quantity to be ^^min-stered wfuch ,s 
generally in the range of 5 micrograms to 250 micrograms of antigen per dose, depends on the sub^ec to 
be treated, capacity of the subject's immune system to synthesize antibodies, and the degree of P^o'ecwn 
desired. Precise amounts of active ingredient required to be administered may depend on the judgment or 
the practitioner and may be peculiar to each subject. crhPrtnifl A 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course of vaccination may be with 1-10 separa e ooses^ 
followed by other doses given at subsequent time intervals required to maintain and or ^eenforce the 
immune response, for example, at 1-4 months for a second dose, and if needed, a subsequent dose(s| after 
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several months. The dosage regimen will aiso, at least in part, be determined by the need of the individual 

and be dependent upon the judgment of the practitioner. 

In addition, the vaccine containing the immunogenic HCV antigen(s) may be administered in conjunction 

with other immunoregulatory agents, for example, immune globulins. 
5 The immunogenic polypeptides prepared as described above are used to produce antibodies, both 

polyclonal and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g., mouse, rabbit. 

goat, horse, etc.) is immunized with an immunogenic polypeptide bearing an HCV epitope(s). Serum from 

the immunized animal is collected and treated according to known procedures. If serum containing 

polyclonal antibodies to an HCV epitope contains antibodies to other antigens, the polyclonal antibodies can 
10 be purified by immunoaffinity. chromatography. Techniques for producing and processing polyclonal 

antisera are known in the art, see for example, Mayer and Walker, eds. (1987) IMMUNOCHEMICAL 

METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, London). 

Monoclonal antibodies directed against HCV epitopes can also be readily produced by one skilled in 

the art. Tlie general methodology for making monoclonal antibodies by hybridomas is well known. Immortal 
15 antibody-producing cell lines can be created by cell fusion, and also by other techniques such as direct 

transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See. e.g., M. 

Schreier et ai. (1980) HYBRIDOMA TECHNIQUES; Hammeriing et ai. (1981), MONOCLONAL ANTIBODIES 

AND T-CELL HYBRIDOMAS; Kennett et al. (1980) MONOCLONAL ANTIBODIES; see also . U.S. Patent 

Nos. 4,341.761: 4.399,121; 4.427.783; 4,444.887; 4,466.917; 4,472,500; 4.491,632; and 4,493,890. Panels of 
20 monoclonal antibodies produced against HCV epitopes can be screened for various properties; i.e.. for 

isotype. epitope affinity, etc. 

Antibodies, both monoclonal and polyclonal, which are directed against HCV epitopes are particularly 

useful in diagnosis, and those which are neutralizing are useful in passive immunotherapy. Monoclonal 

antibodies, in particular, may be used to raise anti-idiotype antibodies. 
25 Anti-idiotype antibodies are immunoglobulins which can^ an "interna! image" of the antigen of tiie 

infectious agent against which protection is desired. Techniques for raising anti-idiotype antibodies are 

known in the art. See, e,g,. Grzych (1985), Nature 316 :74; MacNamara et al. (1984). Science 226 :1325. 

Uytdehaag et al (1985). J. Immunol. 134 :1225. These anti-idiotype antibodies may also be useful for 

treatment and/or diagnosis of NANBH. as well as for an elucidation of the immunogenic regions of HCV 
30 antigens. 

Using the HCV/J1 or HCV''J7 polynucleotide sequences as a basis, oligomers of approximately 8 
nucleotides or more can be prepared, eittier by excision or synthetically, which hybridize with the HCV 
genome and are useful in identification of tiie viral agent(s), further characterization of tiie viral genome(s). 
as well as in detection of the virus(es) in diseased individuals. The probes for HCV polynucleotides (natural 

35 or derived) are a length which allows the detection of unique viral sequences by hybridization. Wliile 6-8 
nucleotides may be a workable length, sequences of about 10-12 nucleotides are preferred, and about 20 
nucleotides appears optimal. These probes can be prepared using routine methods, including automated 
oligonucleotide synthetic metiiods. Among useful probes, for example, are tine clones disclosed herein, as 
well as tfie various oligomers useful in probing cDNA libraries, set forth below. A complement to any unique 

40 portion of the HCV genome will be satisfactory. For use as probes, complete complementarity is desirable, 
though it may be unnecessary as the length of the fragment is increased. 

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, 
may be treated, if desired, to extract tiie nucleic acids contained tiierein. The resulting nucleic acid from the 
sample may be subjected to gel electrophoresis or other size separation techniques: alternatively, the 

45 nucleic acid sample may be dot blotted without size separation. The probes are then labeled. Suitable 
labels, and methods for labeling probes are known in tiie art. and include, for example, radioactive labels 
incorporated by nick translation or kinasing. biotin. fluorescent probes, and chemtluminescent probes. The 
nucleic acids exti-acted from the sample are then ti-eated with the labeled probe under hybridization 
conditions of suitable sf ingencles. Usually high stringency conditions are desirable in order to prevent false 

so positives. The stringency of hybridization is determined by a number of factors dur ing hybridization and 
during the washing procedure, including temperature, ionic strength, lengtfi of time, and concentration of 
formamide. These factors are outiined in, for example, Maniatis. T. (1982) MOLECULAR CLONING; A 
LABORATORY MANUAL (Cold Spring Harbor Press, Cold Spring Harbor. N.Y.). 

Generally, it is expected tiiat the HCV genome sequences will be present in serum of infected 

55 individuals at relatively low levels, i.e., at approximately lO^- 10^ chimp infectious doses (CID) per ml. This 
level may require that amplification techniques be used in hybridization assays. Such techniques are known 
in the art. For example, the Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deox- 
ynucleotide transferase to add unmodified 3'-poly-dT-tails to a DNA probe. The poly dT-tailed probe is 
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hybridized to the target nucleotide sequence, and then to a biotin-modlfied poly-A. PCT App. No. 84/03520 
and EPO Pub. No. 124.221 describe a DNA hybridization assay in which: (1) analyt is ann a! d to a 
single-stranded DNA probe that is complementary to an enzyme-labeled oligonucleotide: and (2) the 
resulting tailed duplex is hybridized to an enzyme-labeled oligonucleotide. EPO Pub. No. 204.510 describes 

s a DNA hybridization assay in which analyte DNA is contacted with a probe that has a tail, such as a poly-dT 
tail, an amplifier strand that has a sequence that hybridizes to the tail of the probe, such as a poly-A 
sequence, and which is capable of binding a plurality of labeled strands. 

A particularly desirable technique may first involve amplification of the target HCV sequences in sera 
approximately lO.OQO-fold. i.e.. to approximately 10^ sequences/ml. This may be accomplished, for 

10 example by the polymerase chain reactions (PCR) technique described which is by Saiki et at. (1986) 
Nature 324 :163. Mullis. U.S. Patent No. 4.683,195. and Mullis et al. U.S. Patent No. 4.683.202. The 
amp<ifiedTequence(s) may then be detected using a hybridization assay which is descnbed in co-pending 
European Publication No. 317-077 and Japanese application No. 63-260347, which are assigned to the 
herein assignee, and are hereby incorporated herein by reference. These hybridization assays, which 

15 should detect sequences at the level of 10^ ml. utilize nucleic acid multimers which bind to smgle-stranded 
analyte nucleic acid, and which also bind to a multiplicity of single-stranded labeled oligonucleotides. A 
suitable solution phase sandwich assay which may be used with labeled polynucleotide probes, and the 
methods for the preparation of probes is described in EPO Pub. No. 225.807 which is hereby incorporated 

herein by reference. ^ . . . . 

20 The probes can be packaged into diagnostic kits. Diagnostic kits include the probe DNA. which may be 
labeled- alternatively, the probe DNA may be unlabeled and the ingredients for labeling may be included in 
the kit in separate containers. The kit may also contain other suitably packaged reagents and matenals 
needed for the particular hybridization protocol, for example, standards, wash buffers, as well as msti-uctions 

for conducting the test. . _ 

25 Both tiie HCV/J1 or HCV/J7 polypeptides which react immunologically with serum containing HCV 
antibodies and ttie antibodies raised against the HCV specific epitopes in these polypeptides are useful m 
immunoassays to detect presence of HCV antibodies, or tiie presence of the virus and'or viral antigens, m 
biological samples. Design of the immunoassays is subject to a great deal of variation, and a variety of 
these are known in the art. An immunoassay for anti-HCV antibody may utilize one viral epitope or several 

30 viral epitopes. When multiple epitopes are used, tiie epitopes may be derived from the same or different 
viral polypeptides, and may be in separate recombinant or natural polypeptides, or together in the same 
recombinant polypeptides. ^ • . 

An immunoassay for viral antigen may use. for example, a monoclonal antibody directed towards a viral 
epitope a combination of monoclonal antibodies directed towards epitopes of one viral polypeptide. 

35 monoclonal antibodies directed towards epitopes of different viral polypeptides, polyclonal antibodies 
directed towards the same viral antigen, polyclonal antibodies directed towards different viral antigens or a 
combination of monoclonal and polyclonal antibodies. 

Immunoassay protocols may be based, for example, upon competition, or direct reaction, or sandwich 
type assays. Protocols may also, for example, use solid supports, or may be. by immunoprecipitation. Most 

40 assays involve the use of labeled antibody or polypeptide. The labels may be, for example, fluorescent, 
chemiluminescent. radioactive, or dye molecules. Assays which amplify the signals h-om the probe are also 
known. Examples of which are assays which utilize biotin and avidin. and enzyme-labeled and mediated 

immunoassays, such as EUSA assays. .u . » i» 

Typically, an immunoassay for anti-HCV antibody will involve selecting and preparing the test sample. 

45 such as a biological sample, and then incubating it with an antigenic (i.e.. epitope-containing) HCV 
polypeptide under conditions that allow antigen-antibody complexes to form. Such conditions are well 
known in the art. In a heterogeneous format, the polypeptide is bound to a solid support to facilitate 
separation of the sample from the polypeptide after incubation. Examples of solid supports that can be used 
are nitrocellulose, in membrane or microtiter well fomn. polyvinylchloride, in sheets or microtiter wells 

50 polystyrene latex, in beads or microtiter plates, polyvlnylidine fluoride, known as ImmobulonTM. diazotized 
paper nylon membranes, activated beads, and Protein A beads. Most preferably, the Dynatech. ImmulonTM 
1 microtiter plate or the 0.25-inch polystyrene beads, which Spec finished by Precision Plastic Ball, are 
used in the heterogeneous format. The solid support is typically washed after separating it from the test 
sample. In a homogeneous format, the test sample is incubated with antigen in solution, under conditions 

55 ttiat will precipitate any antigen-antibody complexes that are formed, as is know in the art. The precipitated 
complexes are then separated from the test sample, for example, by centrifugation. The complexes formed 
comprising anti-HCV antibody are then detected by any of a number of techniques. Depending on the 
format the complexes can be detected with labeled anti-xenogeneic Ig or, if a competitive format is used. 
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by measuring the amount of bound, labeled competing antibody. 

In immunoassays where HCV polypeptides are the analyte, the test sample, typically a biological 
sample, is incubated with anti-HCV antibodies again under conditions that allow the formation of antigen- 
antibody complexes. Various fomnats can be employed, such as a "sandwich" assay where antibody bound 
5 to a solid support is incubated with the test sample: washed;lncubated with a second. labeled antibody to 
the analyte; and the support is washed again. Analyte is detected by determining if the second antibody is 
bound to the support. In a competitive format, which can be either heterogeneous or homogeneous, a test 
sample is usually incubated with and antibody and a labeled, competing antigen either sequentially or 
simultaneously. These and other formats are well known in the art. 
w The Ravivlms model for HCV allows predictions regarding the likely location of diagnostic epitopes for 
the virion structural proteins. The C, pre-M, M. and E domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particularly for diagnosis. Similarly, domains of the nonstructural 
proteins are expected to contain important diagnostic epitopes (e.g.. NS5 encoding a putative polymerase; 
and NS1 encoding a putative complement-binding antigen). Recombinant polypeptides, or viral polypep- 
tides, which include epitopes from these specific domains may be useful for the detection of viral antibodies 
in infections blood donors and infected patients. In addition, antibodies directed against the E and/or M 
proteins can be used in immunoassays for the detection of viral antigens in patients with HCV caused 
NANBH. and in infectious blood donors. Moreover, these antibodies may be extremely useful in detecting 
acute-phase donors and patients. 

Antigenic regions of the putative polyprotein can be mapped and identified by screening the an- 
tigenicity of bacterial expression products of HCV cDNAs which encode portions of the polyprotein. Other 
antigenic regions of HCV may be detected by expressing the portions of the HCV cDNAS in other 
expression systems, including yeast systems and cellular systems derived from insects and vertebrates. In 
addition, studies giving rise to an antigenicity index and hydrophobicity/hydrophilicity profile give rise to 
25 information concerning the probability of a region^s antigenicity. Efficient detection systems may include the 
use of panels of epitopes. The epitopes in the panel may be constructed into one or multiple polypeptides. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the polypeptides of the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable containers, along with the remaining reagents and 
30 materials required for the conduct of the assay (e.g.. wash buffers, detection means like labeled anti-human 
Ig. labeled anti-HCV. or labeled HCV antigen), as well as a suitable set of assay instructions. 

The HCV/Jl and HCV/J7 nucleotide sequence information described herein may be used to gain further 
information on the sequence of the HCV genomes, and for identification and isolation of additional HCV 
isolates related to J1 or J7. This information, in turn, can lead to additional polynucleotide probes. 
35 polypeptides derived from the HCV genome, and antibodies directed against HCV epitopes which would be 
useful for the diagnosis and/or treatment of HCV caused NANBH. 

The HCV/Jl and HCV/J7 nucleotide sequence information herein is useful for the design of probes for 
the isolation of additional sequences which are derived from as yet undefined regions of the HCV genomes 
from which the J1 and J7 sequences are derived. For example, labeled probes containing a sequence of 
40 approximately 8 or more nucleotides, and preferably 20 or more nucleotides, which are derived from 
regions close to the s'-termini or 3'-termini of the family of HCV cDNA sequences disclosed in the 
examples may be used to isolate overlapping cDNA sequences from HCV cDNA libraries. These sequences 
which overlap the cDNAS in the above-mentioned clones, but which also contain sequences derived from 
regions of the genome from which the cDNA in the above mentioned clones are not derived, may then be 
45 used to synthesize probes for identification of other overlapping fragments which do not necessarily overlap 
the cDNAS described below. Methods for constructing cDNA libraries are known in the art. See, e.g. EPO 
Pub. No. 318,216. It is particularly preferred to prepare libraries from the serum of Japanese and other 
Asian patients diagnosed as having NANBH demonstrating antibody to HCV1 antigens; these are believed 
to be the most likely candidates for carriers of HCV/Jl. HCV/J7. or related isolates. 
50 HCV particles may be isolated from the sera from individuals with NANBH or from cell cultures by any 
of the methods known in the art, including for example, techniques based on size discrimination such as 
sedimentation or exclusion methods, or techniques based on density such as uitracentrifugation in density 
gradients, or precipitation with agents such as polyethylene glycol, or chromatography on a variety of 
materials such as anionic or cationic exchange materials, and materials which bind due to hydrophobicity. 
55 A preferred method of isolating HCV particles or antigen is by immunoaffinity columns. Techniques for 
immunoaffinity chromatography are known in the art, including techniques for affixing antibodies to solid 
supports so that they retain their immunoselective activity. The techniques may be those in which the 
antibodies are adsorbed to the support (see. for example. Kurstak in ENZYME IMMUNODIAGNOSIS, page 



16 



EP 0 419 182 A1 



31 37) as well as those in which the antibodies are covalently linked to the support. Generally^ the 

He^ntS remJ?ns accessible. The antibodies may be monoclonal, or polyclonal, and .t may be 
desirable to Durifv the antibodies before their use in the immunoassay. 

Th?aene!al techniques used in extracting the genome from a virus, prepanng and probtng a cDNA 

quences conferring antibiotic resistance markers. These markers may be ."^^f J° f ^'^^^^^^^^^ 
Lsformants by Lection. Commonly used P--^^;^^^^^^^^^^^^ "s'" S Te t' "p) 
(penicillinase, and lactose pr^^^^^^^^^ Res 8 Io57). td th-ilambda-derived P. promoter 

promoter system (Goeddel et al (1980) /l^ .^ga) and the hybrid tac promoter 

and N gene ribosome binding site (Shimatake et al. (1981) Nature ^ .i«iO) ana y 
IrT n ^ ,1 MQfliN Prnr Natl Acad Sci USA 292 :128) derived from sequences of the trp and \ac uva 

Qlvcolyttc enzymes Hess et al. (19bb) J. aqv. cn^ynie y _ Moonx i Rini PhPm 255 

256 -1385) including the promoter for 3 phosphoglycerate kinase (H.tzeman (1980). '^ J^l^f^^ ?^ 

:Ssprffi:»^£i.p— ^^^^^ 

r In o, app,op™^s rZiC::: -firr:^^^^^^ ce„. to oxp.. 

The .accma ..rus "'f" 4midne kinase gen, ol »,e vaccinia .l.us 

, heterologous ^f^' *'J3 °~*^T, V'"^^^^^ « " ««' ■='" 

2. IRL Press]. 
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In addition viral antigens can be expressed in insect cells by the Baculovirus system. A general guide 
to baculovirus expression by Summer and Smith is A Manual of Methods for Baculovirus Vectors and Insect 
Cell Cultur Procedures (Texas Agricultural Experiment Station Bulletin No. 1555). To incorporate the 
hitirotogHiTs gene into t h Baculovirus genome the gene is first cloned into a transfer vector containing 
some Baculovirus sequences. THis transfer vector, when it is cotransfected with wild-type virus into insect 
cells will recombine with the wild-type virus. Usually, the transfer vector will be engineered so that the 
heterologous gene will disrupt the wild-type Baculovirus polyhedron gene. This disruption enables easy 
selection of the recombinant virus since the cells infected with the recombinant virus will appear 
phenotypically different from the cells infected with the wild-type vims. The purified recombinant virus can 
be used to infect cells to express the heterologous gene. The foreign protein can be secreted into the 
medium if a signal peptide is linked in frame to the heterologous gene; otherwise, the protein will be bound 
in the cell lysates. For further information, see Smith et al Mol. & Cell. Biol. 3 :21 56-2165 (1983) or Luckow 
and Summers in Virology 17 : 31-39(1989). „ . , 

Transformation may biby any known method for introducing polynucleotides into a host cell, including, 
for example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct 
uptake of the polynucleotide. The transformation procedure used depends upon the host to be transformed. 
Bacterial transfomiation by direct uptake generally employs treatment with calcium or rubidium chloride 
(Cohen (1972) Proc. Natl. Acad. Sci. USA 69 :2110: Maniatis et al. (1982). MOLECULAR CLONING; A 
LABORATORY MANUAL (Cold Spring Harbor Press, Cold Spring Harbor. N.Y.). Yeast transformation by 
direct uptake may be carried out using the method of Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 75 : 
1929 Mammalian transformations by direct uptake may be conducted using the calcium phosphate 
precipitation method of Graham and Van der Eb (1978), Virology 52 :546 or the various known modifications 
thereof. 

Vector construction employs techniques which are known in the art. Site-specific DNA cleavage is 
performed by treating with suitable restriction enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. The cleaved fragments may be separated using 
polyacrylamide or agarose gel electrophoresis techniques, according to the general procedures found in 
Methods in Enzymology (1980) 65.:499-580. Sticky ended cleavage fragments may be blunt ended using E. 
coli DNA polymerase I (Klenow)Tn the presence of the appropriate deoxynucleotide triphosphates (dNTPs) 
^sent in the mixture. Treatment with SI nuclease may also be used, resulting in the hydrolysis of any 

single stranded DNA portions. , -r. r.K,A ,■ ^ 

Ligations are earned out using standard buffer and temperature conditions using T4 DNA ligase and 
ATP- sticky end ligations require less ATP and less ligase than blunt end ligations. When vector fragments 
are used as part of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
phosphatase (BAP) or calf intestinal alkaline phosphatase to remove the 5 -phosphate and thus prevent 
religation of the vector: alternatively, restriction enzyme digestion of unwanted fragments can be used to 
prevent ligation. Ligation mixtures are transformed into suitable cloning hosts, such as E. coli , and 
successful transfomiants selected by. for example, antibiotic resistance, and screened for the correct 

construction. * ' . . i>. • ^ 

Synthetic oligonucleotides may be prepared using an automated oligonucleotide synthesizer as de- 
scribed by Warner (1984). DNA 3 :401. If desired, the synthetic strands may be labeled with - P by 
treatment witii polynucleotide kinasi in the presence of "P-ATP. using standard conditions for the reaction. 
DNA sequences, including those isolated from cDNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as described by Zoller (1982). Nucleic Acids Res. 10 

^''^DNA libraries may be probed using the procedure of Gmnstein and Hogness (1975), Proc. Natl. Acad. 
Sci USA 73 -3961. Briefly, in this procedure, the DNA to be probed is immobilized on nitirocellulose filters, 
denaturedlnd prehybridized with a buffer. The percentage of fomiamide in the buffer, as well as the time 
and temperature conditions of the prehybridization and subsequent hybridization steps depends on the 
stringency required. Oligomeric probes which require lower stringency conditions are generally used with 
low percentages of formamide, lower temperatures, and longer hybridization times. Probes containing more 
than 30 or 40 nucleotides such as those derived from cDNA or genomic sequences generally employ 
higher temperatures, e.g., about 40-42' C, and a high percentage, e.g., 50%. formamide. Following 
prehybridization, s'-^zp-labeled oligonucleotide probe is added to the buffer, and the filters are incubated in 
this mixture under hybridization conditions. After washing, the treated filters are subjected to autoradiog- 
raphy to show the location of the hybridized probe; DNA in corresponding locations on the original agar 
plates is used as the source of the desired DNA. 

An enzyme-linked immunosorbent assay (ELISA) can be used to measure either antigen or antibody 
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concentrations. This method depends upon conjugation of an enzyme to either an antigen or an antibody, 
and uses the bound enzyme activity as a quantitative label. To measure antibody, the known antigen is 
fixed to a solid phase (e.g., a microplate or plastic cup), incubated with test serum dilutions, washed, 
incubated with anti-immunoglobulin labeled with an enzyme, and washed again. Enzymes suitable for 
labeling are known in the art. and include, for example, horseradish peroxidase. Enzyme activity bound to 
the solid phase is measured by adding the specific substrate, and determining product formation or 
substrate utilization colorimetrically. The enzyme activity bound is a direct function of the amount of 

antibody bound. • , » • • 

To measure antigen, a known specific antibody is fixed to the solid phase, the test matenal contaning 
antigen is added, after an incubation the solid phase is washed, and a second enzyme-labeled antibody is 
added. After washing, substrate is added, and enzyme activity is estimated colorimetrically. and related to 
antigen concentration. 



Examples 



This example describes the cloning of the HCV J1 and HCV/J7 nucleotide sequences. 

Both blood samples which were used as a source of HCV virions were found to be positive in an anti- 
HCV antibody assay. THe HCV isolates from these samples were named HCV.J1 and HCVJ7. The 
infectivity of the blood sample containing the J1 isolate was confirmed by a prospective study of blood 
transfusion recipients. Dr. Tohru Katayama from the Department of Surgery at the National Tokyo Chest 
Hospital collected blood from patients who have contracted post-transfusion non-A. non-B hepatitis. He also 
collected blood samples from the respective blood donors of these patients. Next, these samples were 
assayed for antibodies to the C100-3 HCV1 antigen (EPO Pub. No. 318,216). and blood from one of the 
donors was found to be positive. 

Isolation of the RNA from the blood samples began by pelleting virions In the blood sample by 
ultracentrifugation [Bradley, D.W.. McCaustland, K.A.. Cook E.H.. Schable, C.A.. Ebert. J.W. and Maynard. 
J.E. (1985) Gastroenterology 88 . 773-779]. RNA was then extracted from the pellet by the 
guanidinium-cesium chloride method [Manlatis T., Fritsch. E.F.. and Sambrook J. (1982) "Molecular Cloning: 
A Laboratory Manual", Cold Spring Harbor Uboratory. Cold Spring Harbor] and further punfied by 
phenol/chloroform extraction in the presence of urea, [Berk. A.J. Lee.F.. Harrison. T.. Williams. J. and Sharp. 
P A (1979) Cell 17 , 935"944]. 

' Five pai^oflynthetic oligonucleotide primers were designed from the CE. E. E NSI. NS3^ and NS5 
domains of the nucleotide sequence of HCV1 to isolate fragments from the J1 and J7 genome. The first set 
of primers were to isolate the sequence from the core and some of the envelope domain. The second set of 
primers were to isolate the sequences in the envelope domain. The third set of primers were to isolate a 
fragment which overlapped the putative envelope and non-structural one. NS1. domains. The fourth and f.fth 
set of primers were used to isolate fragments from non-structural domains three and five. NS3 and Nb5. 
The sequences for the various primers are shown below: 
The sequence of the primers for the C- E region were: 
21 S 5 CGTGCCCCCGCAAGACTGCT 3' 
J80A 5 CCGTCCTCCAGAAGCCGGAC 3' 
The sequence of the primers for the E region were: 
71 S 5 GCCGAGGTCATGGGGTACAT 3 
J132A 5' AACTGCGACACCACTAAGGC 3' 
The sequence of the primers for the E. NS1 region were: 
1 27S 5' TGGCATGGGATATG ATGATG 3 

166A 5' TTGAACTTGTGGTGATAGAA 3' , . 

The sequence of the primers for the NS3 region were : 464S 5 GGCTATACCGGCGACTTCGA 3 

526A 5' GACAT6CATGTCATGATGTA 3' 

The sequence of the primers for the NS5 region were: 

870S 5' GCTGGAAAGAGGGTCTACTA 3 

91 7A 5' GTTCTTACTGCCCAGTTGAA 3' 

1 ug of the antlsense primers, 166A, 526A. or 91 7A, was added to 10 units of revers transcnptase 
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(Biorad) to synthesize cDNA fragments from the isolated RNA as the template. The cDNA fragments were 
then amplified by a standard polymerase chain reaction [Saiki. R.K., Scharf. S.. Faloona. F., Mullls, K.B.. 
Hom G.T., Eriich, H.A.. and Arnheim. N. (1985) Science 230 . 1350-1354] after 1 ug of the appropriate 
sense primer, 21S. 71 S, 127S. 464S or 870S. was added. 

The cDNA fragments amplified by the PGR method were gel isolated and cloned by blunt-end ligation 
into the Sma! site of pUC119 [Vieira. J. and Messing. J. (1987) Methods in Enzymotogy 153 .3-11] or into 
the SnaBI site of charomid SB. a derivative of the cloning vector charomid 9-42 [Saito. I. and Stark, G. 
(1986) Proc. Natl. Acad. Sci. USA 83 : 8664-86681. Clones which contain the fragments of the five viral 
domains were successfully constructed. 



II 



IS From the PGR reaction of the Japanese isolates, J1 and J7, three independent clones from each region. 
C/E, E, E/NS1, NS3. and NS5, have been sequenced by the dideoxy chain termination method. 

Sequence from all regions except G/E has been isolated from the J1 isolate. Sequence from only the 
G'E region has been isolated from the J7 isolate. Surprisingly, fragments isolated from both isolates are 
neither longer or shorter than what would be predicted from the HGV1 genome. However, there is 

20 heterogeneity between clones containing sequence from the same region. Consequently, a consensus 
sequence was constmcted for each of the domains. C/E, E. B'NSI, NS3 and NS5, as shown respectively in 
Figures 1 through 5. These differences may be explained as artifacts which occur randomly during the PGR 
amplification [Saiki, R.K., Scharf, S., Faloona, F., Mullis, K.B.. Horn, G.T., Eriich, H.A.. and Arnheim. N. 
(1985) Science 230 , 1350-1354]. Another explanation is that more than one virus genome is present in the 

25 plasma of a singleTiealthy carrier and that these genomes are heterogeneous at the nucleotide level. 

To clarify this point, it was determined how many of these nucleotide differences would lead to amino 
acid changes, using the sequence from the NS3 domain of the J1 isolate as an example. Out of the five 
nucleotide differences, three fall on the third position' of tiie amino acid codon and do not change the amino 
acid sequence. Both of tiie remaining two nucleotide changes fall on the first position of the amino acid 

30 codon and generate amino acid changes of threonine to alanine and proline to alanine, all of which are 
small, neutral amino acid residues. Similarly, when analyzing the nucleotide differences in otiier domains, 
many silent and conserved mutations are found. These results suggest that nucleotide sequences of the 
HCV genomes in the plasma of a single healthy donor are heterogeneous at the nucleotide level. 

In addition, once tfie consensus sequences for each of the fragments were compiled each sequence 

35 was compared to the HCV1 isolate in Figures 6 through 10. In Figure 6 the fragment from the G/E region of 
the J7 isolate shows a 92.8%, 512/552. nucleotide and 97.4%. 150/154. amino acid homology to tiie HGV1 
isolate. The fragment from the E domain of J1 shows a slightiy lower nucleotide and amino acid homology 
to HCV1 in Rgure 7 of 76.2% and 82.9%. respectively. The fragment from tiie J1 isolate which overlaps the 
envelope and non-structural one domains shows the lowest homology to HCV1. as seen in Figure 8. where 

40 the J1 isolate has a 71.5% nucleotide homology and a 73.5% amino acid homology to HCV1. Figure 9 
shows a comparison of the fragment from the NS3 domain of J1 to HCVI. The homology between the 
nucleotides sequences is 79.8%. while the amino acid homology between the isolates is quite high. 92.2% 
or 179/194 amino acids. Figure 10 shows the homology between tiie NS5 sequences from J1 and HCVI. 
The sequences have a 84.3% nucleotide and 88.7% amino acid homology. 

45 The vectors described in the examples above were deposited with the Patent Microorganism Deposi- 
tory, Fermentation Institute, Agency of Industrial Science and Technology at 1-3, Higashi 1-chome Tsukuba- 
chi. Ibaragiken 305. Japan, and will be maintained under the provisions of tiie Budapest Treaty. The 
accession numbers and dates of the deposit are listed below, on page 68. 

so 

III 



An HCV/J1 clone, J1-1519, was isolated using the essentially the techniques described above. 
55 However, the primers used in the isolation were J159S and 199A. The sequences of the oligomeric primers 
J159S and 199A, which follow. Were based upon those in J1-1216 and in HCV1. 
J159S 5 ACT GCC CTG AAC TGC AAT GA 3' 
199A 5' AA1 CCA GTT GAG TTC ATC CA 3 
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Clone J1-1519 is comprised of an HCV cDNA sequence of 367 nucleotides whicfi spans nnost of the 5 - 
half of the NS1 region and which overlaps the E-region clone. J1-1216. by 31 nucleotides. Three 
independent 10 clones spanning this region were sequenced: the sequences. in this region obtained fronn 
the three clones were identical. The sequence of the HCV cDNA in J1-1216 (shown in the figure as J1) and 
the amino acids encoded therein (shown above the nucleotide sequence) are shown in Figure 13. RQure 13 
also shows the sequence differences between J1-1216 in the comparable region of the prototype HCVl 
cDNA (indicated in the figure as PT). and the resulting changes in the encoded amino acids. The homology 
between the J1-1216 and HCVl cDNA is approximately 70% at the nucleotide level, and about 75% at the 

amino acid level. , . l. c„,.,« 

A composite of the sequences from the putative core to NSl region of the J1 isolate is shown in F.gu e 
14- also shown in the figure are the amino acids encoded in the J1 sequence. The variation from the HCVl 
prototype sequence is shown In the line below the J1 nucleotide sequence; the dashed lines indicate 
homologous sequences. The nonhomologous amino acid encoded in the HCVl prototype sequence is 
shown below the HCVl nucleotide sequence. 

Cloned material containing the J1.'1519 HCV cDNA (psl-1519) has been maintained in DH5a. and 
deposited with the Patent Microorganism Depository. 



IV 



Several regions of the J1 isolate, including the C20O-C1O0 region from the putative NS3-NS4 region 
(which encompasses the region encoding the 5-1-1 polypeptide in HSV1 (See EPO Pub. No. 318.216) and 
the putative NSl - E region, were amplified using the PGR method. The C200-C100 region includes 
nucleotides 3799 to 5321 of the prototype HCVl. RNA was extracted as described above, except that 
extraction was with guanidinium thiocyanate in the presence of Proteinase K and sodium dodecylsulfate 
(SDS) (Maniatis (1982). supra). The RNA was transcribed into HCV cDNA by incubation in a 25 ul reaction 
comprised of 1 uM of each primer. 40 units of RNase inhibitor (RNASIN). 5 units of AMV reverse 
transcriptase, and salts and buffer necessary for the reaction. Amplification of a segment of the HCV cDNA 
from the designated region was performed utilizing pairs of synthetic oligomer l6-mer primers. PGR 
amplification was accomplished in three rounds (PCR I. PGR II, and PCRIII). The second and third rounds of 
PCR amplification (PCR II) utilized different sets of PCR primers; the first PCR reaction was diluted 10-fo d 
and multiple rounds of PCR amplification were carried out with the new primers, so that ultimately up to 
50% of the products of the first PCR reaction (PGR I) were reamplified. The primers used for the 
amplification of the regions were the following. These primers, with the exception of J1C200-3 which was 
derived from the J1 isolate sequence, were derived from the prototype HCVl sequence. 



Primers for amplification of the '5-1-1" region from NS3-NS4. 
PCR I 

511/16A (sense, derived from nucleotides starting at number 1528 of HCVl) 
5' AAC AGG CTG CGT GGT C s' 

511'16B (anti-sense, derived from nucleotides ending at 5260 of HCVl) 
s'agttgg TCT QGA CAQ c 3' 
PCR II 

511.'35A (sense, the HCV portion derived from nucleotides starting at number 5057 of HSVl; the 
restriction enzyme site is underlined) 

5' CTTGMTTC TCG TCT TGT COG GGA AGC CGG CAA TC 3 

51 1/35B (anti-se nse, the HCV portion derived from nucleotides ending at number 5233 ot Mbvi. me 
restriction enzyme site is underlined) 

S' CTTGAATTC CCT CTG CCT GAC GGG ACG CGG TCT GC 3 
PCMi 

511 '35A (see supra) 

VSNrc7 (antisense, derived from nuci otides ending at number 5804 of HSVl) 
5 GTA GTG CGT GGG GGA AAC AT 3 

Primers for amplification of the "NS1-E" region 
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PGR I 

J1(E2)3 (sense, the HCV portion derived from nucleotides starting at number 953 of HSVl. the restriction 
enzyme site is underlined) 

5' CTTAGAATTC TGG CAT GGG ATA TGA TGA TG 3' 

J1(E)4 (sense, the HCV portion derived from nucleotides starting at number 1087 of HSV1. the restriction 
enzyme site is underlined) 

5' CTTAGAATTC TCC ATG GTG GGG AAC TGG GC 3' 

J1rc12 (anti-sense, the HCV portion derived from nucleotides ending at 1995 of HSV1. the restriction 
enzyme site is underiined) 

5' CTTAGAATTC CGT CCA GTT GCA GGC AGC TTC 3 
PCRl 

Jlrcl3 (see supra) 

J1IZ-1 (sense, the HCV portion is derived from nucleotides starting at number 1841 of HCV1. the 

restriction enzyme site is underlined) 

5' CTTGAATTC CAA CTG GTT CGG CTG TAC A 3 

J1IZ-2 (sense, the HCV portion is derived from nucleotides starting at number 1596 of HCVl, the 

restriction enzyme site is underlined) 

5' TGA GAC GGA CGT GCT GCT CCT 3' 

Primers for the C200-C100 region of the "NS3-NS4" region 



PCRl 

J1C200-1 (sense, derived from nucleotides starting at number 3478 of HCVl) 
5 TCC TAC TTG AAA GGC TC 3' 

J1C200-3 (anti-sense, derived from nucleotides ending at number 4402 of HCVl) 
5' GGA TCC AAG CTG AAA TCG AC 3' 

J1rc52 (anti-sense, the HCV portion derived from nucleotides ending at 5853 of HCV1, the restriction 
enzyme site is underlined) 

5' CTTAGAATTC GAG GCT GCT GAG ATA GGC AGT 3, 511'16A (see above). 
PCR II 

J1C200-2 (sense, the HCV portion derived from nucleotides starting at number 3557 of HCV1. the 
restriction enzyme site is underlined) ^ 
5' CTTGAATTC CCC GTG GAG TGG CTA AGG CGG TGG ACT 3 

J1C200-4 (anti-sense, the HCV portion derived from nucleotides ending at 4346 of HCV1, the restriction 
enzyme site is underlined) 

5' CTTGAATTC TCG AAG TCG CCG GTA TAG CCG GTC ATG 3' 511 /35A (see above) J1rc51 (anti- 
sense, the HCV portion derived from nucleotides ending at 5826 of HCVl. the restriction enzyme site is 
underiined) 

5' CTTAGAATTC GGC AGC TGC ATC GCT CTC CGG CAC 3' i 
The amplified HCV cDNAs were either sequenced directly without cloning, andor were cloned. 
Sequencing was accomplished using an assymetric PCR technique, essentially as described in Shyamala 
and Ames, J. Bacteriology 171 :1602 (1989). In this technique, amplification of the cDNA is carried out with 
a limiting concentration of one of the primers (usually in a ratio of about 1:50) in order to get preferential 
amplification of one strand. The preferentially amplified strand is then sequenced by the dideoxy chain 
termination method. 

The primers used for assymetric sequencing by the PCR method were the following. For the NSl 
region: J1IZ-1 and J1rc13 (sequenced with both): J1IZ-2, J1rc13 (confirmed on both strands). For the NS3- 
NS4 region, which includes the C20O-C100 IM-term1nal region, C200-C100 C-terminal region, and the 5-1-1 
region: J1C200-2 and J1C200-7 (for the N-terminal region of C200-C100), and J1C200-4 and J1C200-6 (for 
the C200-C100 C-terminal* region); and 51 V35A and hep 4 (for the 5-i-1 region). The sequences for J1C200- 
2, J1C200-4. and 511/35A are shown supra; the sequences of hep 4, J1C200-6. and J1C200-7 are the 
following. 

hep 4 (derived from nucleotides starting at number 541-5 of HCVl) 

GGC TAG TGG TTA GTG GGC TGG TGA CAG 3' 
J1C200-6 (the HCV portion derived from nucleotides starting at number 3875 of HCVl, the restriction 
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?rT<i?T?0^rACT CCA CCT ACG GCA AGT TCC TT 3' HCV po,.o„ darted 

nu cieollcles slart ng at riumter 3946 of HCV1. the restnction enzyme site Is unaerliredl 

5 1 1 TZI-e sLn in figure 15. «a Rjure 16. -espectively. M the figures, the ammo acds 
in « Ji^»^ are *own ai=ove me Jl nudeotide sequence. The differences 1"^ 
»™ce »M me HCV1 pratotwe nucleotld. s»w>c. is sho«n l»low me Jl sequence (me dashes 
SS;^ hoi^s nuclsStides in foth s«,u»k.»I. The encoded amino aolde «h.ch d<fer ,n the HC« 

'° "'TJcr^TjrS" ^rt'hT'^^^rirrme S.,.. reg^n «ere c»ne.. A 300 hp 

;rsVSC^n:,SEr3z°po,,,n...^ in« *"p' 0- ^j,; ^ 

^ ^HMA franmfints mav be removed with SacI and Aoai. nuv cuinms. 

" ™T^t,e,:^i fSmCo, Sole^Led into pMIE in HB101. ,2 ct^ee »ere poo.d 

^e^iSl ml ATCC as AW-770.,p« m, HOV dJNA may be r.mo,«l from me vec»r «,m H», . 

^NA S^m Jl aquivalent to the 5-1-1 region of HCV1 was cloned into mp19 R1 ■ ^"^i" ' 

DH5«-F^ Several m13 phage superanants from this cloning were pooled and deposited with the ATCC as 
?, 5 Vl" orSeoternLr 1? 1990. The HCV cDNAs may be obtained from the phage by treatment w,th 
EcoRI Acces^^^^^^^^^ for J1 5-1-1 and AW-ZOObp-N or AW-700bp-C may be obtained by telephoning 

^^abt^dlSed led material was deposited with the American Type Culture Collection (ATCC). 
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An HCV cDNA library containing sequences of the putative "NSr region of the J1 isolate was created 
An HCV cDNA '^f^'V "JJ^lT^ „^ about nucleotide 1460 to about nucleotide 

gaSetrbJ ng^^^^^^ ^e^CvfZtpe nucleic acid sequence, where r.uc.eo«de 1 . the 
^nSde Jthe initi'a^n; methionine codon Puta«ve jpo^^^^^^^^^ 

rpsrrshi-r ^src^r t rr;^o?rsi^nt^^^^^^^^^^ 

I m ieZ^ cUm tSTsJn^ in a defined orienta.on relative to the ^acZ pron^oter in x-phage. U^ng 
mis IS the HCV cDNAs to Jl RNA were inserted into the Notl site of X-gt22. The presence of NS1 
qeauences in the library was detected using as probe, Alx54. u ^^hirh 

sequence the portion of the "NS1" region were the following. 



JHC 67 

rSACGC GGCCG CCTCC GTGTC CAGCG CQT 3 
JHC 68 

rCGTGC GGCCG CAAGA CTGCT AGCCG AGGT 3 
55 ALX 61 

5ACCTG CCACT GTGTA GTGGT CAGCA GTAAC 3 

Al-X 62 ' 
rACGGA CGTCT TCGTC CTTAACAATA CCAGG a' 
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ALX 54 

S^AACT TTGCG ATCTG GAAGACAGGG ACAGG 3 

A 400 bp fragment of J1 HCV cDNA derived from the sequenced region was cloned into pGEM3z and 
maintained in HB101; the HCV cDNA may be removed from the vector with Sad and Xbal. Host ceils 
transformed with the vector {JH-400bp) have been deposited with the ATCC. 

A pooled cDNA library was created from the J1 serum; the pooled library spans the J1 genome and is 
identified as HCV-J1 X gt22. The pooled cDNA library was created by pooling aliquots of 11 individual 
cDNA libraries, which had been prepared using the directional cloning technique described above, except 
that the libraries were created from primers which were designed to yield HCV cDNAs which spanned the 
genome. The primers were derived from the sequence of HCV1, and included JHC 67 and JHC 68. The 
HCV cDNAs were inserted into the NotI site of X-gt22. The pooled cDNA library. HCV-J1 X gt22. has been 
deposited with the ATCC. 



VI 



The sequence of a region of the polynucieotide upstream of that shown in Figure 14 was detennined. 
This region begins at nucleotide -267 with respect to the HCV-1 (See Figure 12) and extends for 560 
nucleotides. Sequencing was accomplished by preparing HCV cDNA from RNA extracted from J1 serum, 
and amplifying the HCV cDNA using the PCR method. 

RNA was extracted from 100 al of serum following treatment with proteinase K and sodium dodecylsul- 
fate (SDS). The samples were extracted with phenol-chloroform, and the RNA precipitated with ethanol. 

HCV cDNA from the J1 isolate was prepared by denaturing the precipitated RNA with 0.01 M MeHgOH; 
after ten minutes at room temperature, 2-mercaptoethanol was added to sequester the mercury ions. 
Immediately, the mix for the first strand of cDNA synthesis was added, and incubation was continued for 1 
hr at 37" C. The conditions for the synthesis of the anti-sense strand were the following: 50 mM Tris HCI. 
pH 8.3, 75 mM KCI, 3 mM MgCIa, 10 mM dithiothreitol. 500* M each deoxynucleotide triphosphate. 250 
pmoi specific antisense cDNA primer r25, 250 units MMLV reverse transcriptase. In order to synthesize the 
second strand (sense), the synthesis reaction components were added, and incubated for one hour at 
14* C. The components for the second strand reaction were as follows: 14 mM Tris HCI. pH 8.3. 68 mM 
KCI, 7.5 mM ammonium sulfate, 3.5 mM MgClz, 2.8 mM dithiothreitol. 25 units DNA polymerase I. and one 
unit RNase H. The reactions were terminated by heating the samples to 95* C for 10 minutes, followed by 
cooling on ice. 

The HCV cDNA was amplified by two rounds of PCR, The first round was accomplished using 20 ui of 
the cDNA mix. The conditions for the PCR reaction were as follows: 10 mM Tris HCI. pH 8.3. 50 mM KCI. 
1.5 mM MgCl2. 0.002% gelating, 200 mM each of the deoxynucleotide triphosphates.^ and 2.5 units 
Amplitaq, The PCR thermal cycle was as follows: 94* C one minute. 50* C one minute. 72'C one minute, 
repeated 40 times followed by seven minutes at 72 'C. The second round of PCR was accomplished using 
nested primers (i.e. primers which bound to an internal region of the first round of PCR amplified product) 
to increase the specificity of the PCR products. One percent of the first PCR reaction was amplified 
essentially as the first round, except that the primers were substituted, and the second step in the PCR 
reaction was at 60* C instead of 50 ' C. The primers used for the first round of PCR were ALX90 and r14. 
The primers used for the second round of PCR were r14 and p14. 

The sequences of the primers for the synthesis of HCV cDNA and for the PCR method were the 
following. 
r25 

5 ACC TTA CCC AAA TTG CGC GAC CTA 3 
ALX90 

5' CCA TGA ATC ACT CCC CTG TGA GGA ACT A 3 
r14 

5' GGG CCC CCAG CTA GGC CGA GA 3 
p14 

5' AAC TAC TGT CTT CAC GCA GAA AGC 3 

The PCR products were gel purified, the material which migrated as having about 615 bp was isolated, 
and sequenced by a modification of the Sanger dideoxy chain termination metiiod. using 32p-ATP as label. 
In the modified method, the sequence replication was primed using P32 and R31 as primers: the double 
stranded DNA was melted for 3 minutes at 95* C prior to replication, and the synthesis of labeled dideoxy 
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terminated polynucleotides was catalyzed by Bst pclymeras (obtained from BioRad Corp.). according to 
the manufacturer's directions- The sequencing was performed using 500ng to t ug of PGR product per 
sequencing reaction. 

The primers P32 (sense) and R31 (antisense) w re derived from nucleotides -137 to -115 and from 
5 nucleotides 192 to 173, respectively, of the HCV1 sequence. The sequences of th_ primers are the 
following. 
P32 primer 

5' AAC CCG CTC AAT GCC TGQ AGA TT 3 
R31 primer 

W 5 GGC CGX CGA GCC TTG GGG AT 3 
where X = A or G 

The sequence of the region in the J1 isolate which encompasses the 5 -untranslated region as well as a 
part of the region of the putative "Core" is shown in Rgure 18. In the figure, amino acids encoded in the Jl 
sequence are shown above the nucleotide sequence. The sequence of the prototype HCV1 is shown below 
15 the Jl sequence; the dashes indicate sequence homology with Jl. The differing amino acids encoded in 
the HCVt sequence are shown below the HCV1 sequence. 

An HCV cDNA fragment which is a representative of the 600 bp Jl sequence described above (TO 
600bp) was cloned into pGEM3Z and maintained in host HB101; the HCV cDNA fragment may be removed 
with Saci and Xbal. This material is on deposit with the ATCC. 
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Patent Microorganism Depository-deposited under Budapest 
Treaty terms. 

Deposited Materials Accession Number Deposit Date 

E. coli DH5/pSl-8791a BP-2593 9/15/1989 

(This Clone contains 427 bp of the HS5 domain of Jl) 
E. coli HB101/pUl-1216c BP-2594 9/15/1989 

(This Clone contains 351 bp of the E/NSl domains of Jl) 
E. coli HB101/pUl-4652d BP~2595 9/15/1989 

(This Clone contains 583 bp of the NS3 domain of Jl) 
E.coli DH5a/pSl-713c BP-2637 11/1/1989 

(This clone contains 580 bp of the E domain of Jl) 
E, coli DH5a/pS7-28c BP-2638 11/1/1989 

(This clone contains 552 bp of the C/E domain of J7) 
40 E. coli DH5a/psl-1519 BP3081 8/30/90 

The following vectors described in the Examples were deposited with the American Type Culture 
Collection (ATCC), 12301 Parklawn Dr.. Rockville, Maryland 20852. and have been assigned the following 
Accession Numbers. The deposits were made under the terms of the Budapest Treaty. 
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Deposited Materials Accession Number Deposit p^te 

TC-600BP (in 

coli HB101/PGEM3Z) 68393 9/11/90 
^ JH-400bp (in 

E. coli HB101/pGEM3Z) 68394 9/11/90 
AW-30bbp (in 

70 E. coli HB101/PGEM32) 68392 9/11/90 
AW-770bp-N (in 

E. coli HBlOl/pMlE) 68395 9/11/90 
AW-TOObp-C or AW-70Gbp-N (in 
E. coll DH5a-F'/M13inplO) 

E, coli DH5a-FVM13]:nplO) 
20 HCV-Jl X gt22 40884 9/6/90 



These deposits are provided for the convenience of those skilled in the art. These deposits are neither an 
admission that such deposits are required to practice the present invention nor that equivalent ennbodiments 
are not within the skill of the art in view of the present disclosure. The public availability of these deposits is 
not a grant of a license to make, use or sell the deposited materials under this or any other patent. The 
nucleic acid sequences of the deposited materials are incorporated in to present disclosure by reference^ 
and are controlling if in conflict with any sequences described herein. 

While the present invention has been described by way specific examples for the benefit of those in the 
field, the scope of the invention is not limited as additional embodiments will be apparent to those of skill in 
the art from the present disclosure. 



Claims 

35 

1. A DNA molecule comprising a nucleotide sequence of at least 15 bp from an HCV Isolate substantially 
homologous to an Isolate selected from the group J1 and J7. wherein said nucleotide sequence is distinct 
from the nucleotide sequence of HCV isolate HCV1. 

2. A DNA molecule comprising a nucleotide sequence of at least 15 bp encoding an amino acid sequence 
^ from the HCV isolate J1 or J7 wherein the J1 or J7 amino acid sequence is distinct from the amino acid 

sequence of HCV isolate HCV1 . 

3. A DNA molecule according to claim 2 wherein the J1 or J7 amino acid sequence comprises a 
substantially complete viral polypeptide. 

4 A DNA molecule according to claim 2 wherein the J7 amino acid sequence is amino acid 1 to amino acid 
^5 115. 

5. A DNA molecule according to claim 2 wherein the J1 amino acid sequence is from amino acid 116 to 
amino acid 350. 

6. A DNA molecule according to claim 2 wherein the J1 amino acid sequence is from amino acid 351 to 
amino acid 651. 

^° 7. A DNA molecule according to claim 2 wherein the Jl amino acid sequence is from amino acid 1007 to 
amino acid 1650. 

a. A DNA molecule according to claim 2 wherein the Jl amino acid sequence is from amino acid 2100 to 
the end of the coding sequence. 

9. A purified polypeptide comprising an amino acid sequence from an HCV isolate substantially homologous 
to an isolate selected from the .group consisting of Jl and J7 wherein the amino acid sequence is distinct 
from the sequence of the polypeptides encoded by the HCV isolate HCV1. 

10. A purified polypeptide according to claim 9 wherein the J1 or J7 amino acid sequence comprises an 
epitope that is not immunologically cross-reactive with any HCV1 epitope. 
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11. A purified polypeptide according to claim 9 wherein the J7 amino acid sequence is from amino acid 1 to 

amino acid 115. . « ■ <ie 

12. A purified polypeptide according to claim 9 wherein the J1 amino acid sequence .s from ammo acid i ib 

to amino acid 350. ,j oci 

13. A purified polypeptide according to claim 9 wherein the J1 amino acid sequence is from ammo acid 351 

to amino acid 651. . ^ .. 

14. A purified polypeptide according to claim 9 wherein the J1 amino acid sequence is from ammo acid 

1007 to amino acid 1650. 

15. A purified polypeptide according to claim 9 wherein the J1 amino acid sequence is from ammo and 
2100 to the end of the coding sequence. 

16 A polypeptide comprising an amino acid sequence from an HCV isolate selected from the group 
consisting of J1 and J7 wherein the Jl or J7 amino acid sequence is distinct from the ammo acid sequence 
of HCV isolate HCV1 and the polypeptide is immobilized on a solid support. 

17 An immunoassay for detecting the presence of anti-HCV antibodies in a test sample compnsmg: 

(a) incubating the test sample under conditions that allow the formation of antigen-antibody complexes 
with an antigenic polypeptide comprising an amino acid sequence from an HCV isolate selected from the 
group consisting of Jl and J7. wherein the amino acid sequence is distinct from the ammo acid 
sequence of HCV isolate HCV1; and (b) the Jl or J7 amino acid sequence is not immunologically cross- 
reactive with HCV1 ; and 

(b) detecting an antigen-antibody complex comprising the antigenic polypeptide. 

18 An immunoassay according to claim 17 wherein the Jl amino acid sequence is from a viral domain 
selected from the group consisting of amino acid sequence from amino acid 116 to ammo acid 350. from 
amino acid 351 to amino acid 651. from amino acid 1007 to amino acid 1650. and from ammo acid 2100 to 

the coding sequence. ^ i 

19. An immunoassay according to claim 17 wherein the Jl amino acid sequence is from ammo acid 1 to 

amino acid 115. ,., j t. *■ 

20. An immunoassay according to claim 17 wherein the test sample comprises human blood or a fraction 

rrrcomposition comprising anti-HCV antibodies that bind an HCV epitope substantially free of antibodies 
that do not bind an HCV epitope, wherein: 

(a) the HCV epitope comprises an amino acid sequence from an HCV isolate selected from the group 

consisting of Jl and J7; u/xi/i -j 

(b) the J1 of J7 amino acid sequence is distinct from the amino acid sequence of HCV isolate HCV1: and 

(c) the Jl or J7 amino acid sequence is not immunologically cross-reactive with HCV1. 

22 A composition according to claim 21 wherein the anti-HCV antibodies are polyclonal. 

23 A composition according to claim 21 wherein the anti-HCV antibodies are monoclonal. 

24 An immunoassay for detecting the presence of an HCV polypeptide In a test sample compnsmg: 

(a) incubating the test sample under conditions that allow the formation of antigen-antibody complexes 
with anti-HCV antibodies tiiat bind an HCV epitope wherein: 

(i) the HCV epitope comprises an amino acid sequence from an HCV isolate selected from the group 

consisting of J 1 and J7; . ^r^., . ur-ui. 

(ii) the Jl or J7 amino acid sequence is distinct from the amino acid sequence of HCV isolate HCVi. 

^HMhe Jl or J7 amino acid sequence is not immunologically cross-reactive with HCVI: and (b) 
detecting an antigen-antibody complex comprising the anti-HCV antibodies. 

25 An immunoassay according to claim 24 wherein the Jl amino acid sequence is from a viral domain 
selected from the group consisting of amino acid sequence from amino acid 116 to ammo acid 350 from 
amino acid 361 to amino acid 651, from amino acid 1007 to amino acid 1650. and from ammo acid 2100 to 
the end of the coding sequence. 

26. An immunoassay according to claim 24 wherein the J7 amino acid sequence is from ammo acid 1 to 

27 "a method of producing anti-HCV antibodies comprising administering to a mammal a polypeptide 
comprising an amino acid sequence from an HCV isolate selected from ttie group consisting of Jl and J7 
wherein the J1 or J7 amino acid sequence is distinct from the amino acid sequence of HCV isolate Mi^vi 
whereby the mammal produces anti-HCV antibodies. 
28. A method of detecting HCV polynucleotides in a test sample comprising: 

(a) providing a probe comprising the DNA molecule of claim 1; 

(b) contacting the test sample and the probe under conditions that allow for the fomiahon of a 
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polynucleotide duplex between the probe and its complement in the absence of substantial poly- 
nucleotide duplex fomiation between the probe and non-HCV polynucleotide sequences present in the 
test sample; and 

(c) detecting any polynucleotide duplexes comprising the probe. 
5 29. A method of producing a recombinant polypeptide comprising an HCV amino acid sequence, the 
method comprising: 

(a) providing host cells transformed by a DNA construct comprising a control sequence for the host ceil 
operably linked to a coding sequence for the host cell operably linked to a coding sequence encoding an 
amino acid sequence from an HCV isolate selected from the group comprised of J1 and J7 wherein the 

10 J1 or J7 amino is distinct from the amino acid sequence of HCV isolate HCV1 ; 

(b) growing the host cells under conditions whereby the coding sequence is transcribed and translated 
into the recombinant polypeptide; and 

(c) recovering the recombinant polypeptide. 

30. A biological material derived from the group consisting of materials deposited under Accession 
75 Numbers BP-2593. BP-2594, BP-2595. BP-2637. BP-2638, BP-3081. ATCC No. 68392, ATCC No. 68393. 
ATCC No. 68394. ATCC No. 68395. and ATCC No. 408884. 
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J7 1 AGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGT 

discrepancy 
clone 

altered aa 



J7 37 



ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 



Met Ser Thr Asn 

J7 73 TCTCGTA6ACCGTGCATC ATG AGC ACA AAT 

Pro Lys Pro Gin Arg Lys Thr Lys Arg 
J7 103 CCT AAA CCC CAA AGA AAA ACC AAA CGT 

T G 
b 1 
Arg 



J7 



Asn Thr Asn Arg Arg Pro Gin Asp Val 
130 AAC ACC AAC CGT CGC CCA CAG GAC GTT 

b 



J7 



Lys Phe Pro Gly Gly Gly Gin He Val 
157 AAG TTC CCG GGC GGT GGT CAG ATC GTC 

1 ° 
Leu 



J7 



Gly Gly Val Tyr Leu Leu Pro Arg Arg 
184 GGT GGA GTT TAG TTG TTG CCG CGC AGG 

A 
b 



J7 



Gly Pro Arg Leu Gly Val Arg Ala Thr 
211 GGC CCC AGG TTG GGT GTG CGT GCG ACT 

FIG. 1-1 
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Ara Lys Thr Ser Glu Arg Ser Gin Pro 
238 AGG AAG ACT TCC GAG CGG TCG CAA CCT 

A 
b 



265 



Ara Gly Arg Arg Gin Pro He Pro Lys 
CGT GGA AGG CGA CAA CCT ATC CCC AAG 



292 



Ala Arg Arg Pro Glu Gly Arg Thr Trp 
GCT CGC CGG CCC GAG GGC AGG ACC TGG 



319 



Ala Gin pro Gly Tyr Pro Trp Pro Leu 
GCT CAG CCT GGG TAT CCT TGG CCC CTC 



Tvr Gly Asn Glu Gly Leu Gly Trp Ala 
346 TAT GGC AAT GAG GGC TTG GGG TGG GCA 

A 
b 
END 



373 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 
GGA TGG CTC CTG TCA CCC CGC GGC TCT 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 
400 CGG CCT AGT TGG GGC CCC AAT GAC CCC 

T C 
c b 
Thr 



427 



Arq Arg Arg Ser Arg Asn Leu Gly Lys 
CGG CGT AGG TCG CGT AAT TTG GGT AAG 



Val He Asp Thr Leu Thr Cys Gly Phe 
454 GTC ATC GAT ACC CTT ACA TGC GGC TTC 

FIG. 1-2 I 

Leu 
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J7 



Ala Asp Leu Met Gly Tyr He Pro Leu 

481 GCC GAC CTC ATG GGG TAC ATT CCG CTT 

c c 

c b 



J7 



Val Gly Ala Pro Leu Gly Gly Ala Ala 
508 GTC GGC GCC CCC TTA GGG GGC GCT GCC 



J7 



535 



Arg Ala Leu Ala His Gly 
AGG GCC CTG GCA CAT GGT 



FIG. 1-3 
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pro Leu Val Gly Ala Pro Leu Gly Gly 
jl 1 T CC6 CTC GTC GGC GCC CCC TTA GGG GGC 
discrepancy ^ 

^^^^^ ser 
altered aa 

Ala Ala Arg Ala Leu Ala His Gly Val 
29 GCT GCC AGO GCC CT6 GCA CAT GGT GTC 

Arg Val Leu Glu Asp Gly Val Asn Tyr 
56 CGG GTT CTG GAG GAC GGC GTG AAC TAT 



83 



Ala Thr Gly Asn Leu Pro Gly Cys Ser 
GCA ACA GGG AAT TTG CCC GGT TGC TCT 



Phe ser He Phe Leu Leu Ala Leu Leu 
110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

A T 
g d 



ser cys Leu Thr He Pro Ala Ser Ala 
137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 

Tyr Glu Val Arg Asn Val Ser Gly He 
164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 

Tyr His Val Thr Asn Asp Cys Ser Asn 
191 TAC CAT GTC ACA AAC GAC TGC TCC AAC 

T 
d 



Ser Ser He Val Tyr Glu Ala Ala Asp 
218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 

FIG 2-1 



32 



EP 0 419 182 A1 



245 



Val He Met His Ala Pro Gly Cys Val 
GTG ATC ATG CAT GCC CCC GGG TGC GTG 



272 



Pro Cys Val Arg Glu Asn Asn Ser Ser 
CCC TGC GTT CGG GAG AAC AAT TCC TCC 

C 

d 



299 



Arg Cys Trp Val Ala Leu Thr Pro Thr 
CGT TGC TGG GTA GC6 CTC ACT CCC ACG 



326 



Leu Ala Ala Arg Asn Ala Ser Val Pro 
CTC GCG GCC AGG AAT GCC AGC GTC CCC 



353 



Thr Thr Thr Leu Arg Arg His Val Asp 
ACT ACG ACA TTA CGA CGC CAC GTC GAC 

G 

d 



380 



Leu Leu Val Gly Thr Ala Ala Phe Cys 
TTG CTC GTT GGG ACG GCT GCT TTC TGC 



407 



Ser Ala Met Tyr Val Gly Asp Leu Cys 
TCC GCT ATG TAG GTG GGG GAT CTC TGC 



Gly Ser Val Phe Leu He Ser Gin Leu 

GGA TCT GTT TTC CTC ATC TCC CAG CTG 

T 
d 



461 



Phe Thr Phe Ser Pro Arg Arg His Glu 
TTC ACC TTC TCG CCT CGC CGG CAT GAG 

FIG. 2-2 
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488 



Thr Val Gin Asp Cys Asn Cys Ser lie 
^ GTA CAG GAC TGC AAC TGC TCA ATC 



515 



Tvr Pro Gly His Val Ser Gly His Arg 
TAT CCC GGC CAC GTA TCA GGC CAT CGC 

c 



542 



Met Ala Trp Asp Met Met Met Asn Trp 
A?G GCT GAT ATG AT6 ATG AAC TGG 



569 



Ser Pro Thr Ala 
TCG CCC ACG GCA 

FIG. 2-3 
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Asn Trp Ser Pro Thr 
1 AAC TGG TCG CCC ACG 

discrepancy 
clone 
altered aa 



Jl 16 



Ala Ala Leu Val Val Ser Gin Leu Leu 
GCA GCC TTA GTG GTG TCG GAG TTA CTC 



I I i 



Jl 



43 



Arg He Pro Gin Ala Val Met Asp Met 
CGG ATC CCA CAA 6CT GTC ATG GAC ATG 



Val Ala Gly Ala His Trp Gly Val Leu 
GTG GCG GGG GCC CAC TGG GGA GTC CTA 



Ala Gly Leu Ala Tyr Tyr Ser Met Val 
GCG GGC CTT GCC TAC TAT TCC ATG GTG 

A 
i 



Jl 124 



Gly Asn Trp Ala Lys Val Leu He Val 
GGG AAC TGG GCT AAG GTT TTG ATT GTG 



Jl 151 



Met Leu Leu Phe Ala Gly Val Asp Gly 
ATG CTA CTC TTT GCC GGC GTT GAC GGG 



Jl 178 



His Thr Arg Val Thr Gly Gly Val Gin 
CAT ACC CGC GTG ACG GGG GGG GTG CAA 
AG A 
gg 1 
Ser 



FIG. 3-1 
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Jl 205 



Glv His Val Thr Ser Thr Leu Thr Ser 
GGC CAC GTC ACC TCT ACA CTC ACG TCC 
T G 

c i 
Ala 



Jl 232 



Leu Phe Arg Pro Gly Ala ser Gin Lys 

CTC TTT AGA CCT GGG GCG TCC CAG AAA 



Jl 259 



He Gin Leu Val Asn Thr Asn Gly Ser 
ATT CAG CTT GTA AAC ACC AAT GGC AGT 

TC T 

ii i 

Ser Leu 



Jl 286 



Trp His He Asn Arg Thr Ala Leu Asn 
TGG CAT ATC AAC AGG ACT GCC CTG AAC 

g 



Jl 313 



Cys Asn Asp Ser Leu Gin Thr Gly Phe 
TGC AAT GAC TCC CTC CAA ACT GGG TTC 



Jl 340 



Leu Ala Ala Leu 
CTT GCC GCG CTG 

FIG. 3-2 
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Ser 

Jl 1 C TCA 

discrepancy 

clone 

altered aa 



Val He Asp Cys Asn Thr Cys Val Thr 
5 GTG ATC GAC TGT AAC ACA TGT GTC ACT 

Gin Thr Val Asp Phe Ser Leu Asp Pro 
32 GAG ACG GTC GAT TTC AGC TTG GAT CCC 

Thr Phe Thr He Glu Thr Thr Thr Val 
59 ACC TTC ACC ATC GAG ACG ACG ACC GTG 
G 
c 

Ala 

Pro Gin Asp Ala Val Ser Arg Thr Gin 
86 CCC CAA GAT GCG GTT TCG CGC ACG CAG 

Arg Arg Gly Arg Thr Gly Arg Gly Arg 
113 CGG CGA GGT AGG ACT GGC AGG GGC AGG 

Arg Gly He Tyr Arg Phe Val Thr Pro 
140 AGA GGC ATC TAT AGG TTT GTG ACT CCA 

Gly Glu Arg Pro Ser Ala Met Phe Asp 
167 GGA GAA CGG CCC TCG GCG ATG TTC GAT 

Ser Ser Val Leu Cys Glu Cys Tyr Asp 
194 TCT TCG GTC CTA TGT GAG TGT TAT GAC 

Ala Gly cys Ala Trp Tyr Glu Leu Thr 
221 GCG GGC TGT GCT TGG TAT GAG CTC ACG 
A 
e 

Gly(=) 

FIG. 4-1 
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Jl 



Pro Ala Gin Thr Ser Val Arg Leu Arg 
248 CCC GCT GAG ACC TCG GTT AGG TTG CGG 



Jl 



Ala Tyr Leu Asn Thr Pro Gly Leu Pro 
275 GCT TAG CTA AAT ACA CCA GGG TTG CCC 



Jl 



Val Cys Gin Asp His Leu Glu Phe Trp 
302 GTC TGC CAG GAC CAT CTG GAG TTC TGG 



Jl 



Glu Ser Val Phe Thr Gly Leu Thr His 
329 GAG AGC GTC TTC ACA GGC CTC ACC CAC 



Jl 



lie Asp Ala His Phe Leu Ser Gin Thr 
3 56 ATA GAC GCC CAC TTC TTG TCC CAG ACT 



Jl 



Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
383 AAG CAG GCA GGA GAC AAC TTC CCC TAC 



Jl 



Leu Val Ala Tyr Gin Ala Thr Val Cys 
410 CTG GTA GCA TAC CAA GCC ACA GTG TGC 



Ala Arg Ala Lys Ala Pro Pro Pro Ser 
Jl 437 GCC AGG GCT AAG GCT CCA CCT CCA TCG 

C 

Ala(=) 



Jl 



Trp Asp Gin Met Trp Lys Cys Leu He 
464 TGG GAT CAA ATG TGG AAG TGT CTC ATA 



Arg Leu Lys Pro Thr Leu His Gly Pro 
Jl 491 CGG CTA AAG CCT ACG CTG CAC GGG CCA 

G 
e 
Ala 



FIG. 4-2 
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Thr Pro Leu Leu Tyr Arg Leu Gly Ala 
Jl 518 ACG CCC CTG CTG TAT AGG CTA GGA GCC 

A 
e 

Arg(=) 

Val Gin Asn Glu Val Thr Leu Thr His 
Jl 545 GTC CAG AAT GAG GTC ACC CTC ACA CAC 

Pro lie Thr Lys 
Jl 572 CCT ATA ACC AAA 

FIG. 4-3 
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Leu Thr 
1 C CTC ACC 

discrepancy 
clone 
altered aa 



8 



Ara ASP Pro Thr Val Pro Leu Ala Arg 
gIc CC? ACC GTC CCC CTT GCG CGG 



Ala Ala Trp Glu Thr Ala Arg His Thr 
35 GOT GCG TGG GAG ACA GCT AGA CAC ACT 



g 

Thr(=) 



Pro Val Asn Ser Trp I-eu Gly Asn lie 
62 CCA GTC AAC TCC TGG CTA GGC AAC ATC 

He Met Tyr Ala Pro Thr Leu Trp Ala 
89 ATC ATG TAT GCG CCC ACT TTG TGG GCA 
T 

g 

Ile(=) 

Ara Met He Leu Met Thr His Phe Phe 
116 AGG ATG ATT CTG ATG ACT CAC TTC TTC 

ser He Leu Leu Ala Gin Glu Gin Leu 
143 TCC ATC CTT CTA GCC CAG GAG CAA CTT 



Glu Lys Ala Leu Asp Cys Gin He Tyr 
170 GAA AAA GCC CTG GAT TGT CAA ATC TAG 

Glv Ala Cys Tyr Ser He Glu Pro Leu 
197 GGG GCC TGT TAC TCC ATT GAG CCA CTT 

FIG. 5-1 
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Jl 



Asp Leu Pro Gin lie He Glu Arg Leu 
224 GAC CTA OCT GAG ATC ATT GAA CGA CTC 



Jl 



His Gly Leu Ser Ala Phe Ser Leu His 
251 CAT GGT CTT AGC GCA TTT TCA CTC CAT 



Jl 



Ser Tyr Ser Pro Gly Glu lie Asn Arg 
278 AGT TAC TCT CCA GGT GAG ATC AAT AGG 



Jl 



Val Ala Ser Cys Leu Arg Lys Leu Gly 
305 GTG GCT TCA TGC CTC AGG AAG CTT GGG 



Jl 



Val Pro Pro Leu Arg Val Trp Arg His 
332 GTA CCA CCC TTG CGA GTC TGG AGA CAT 



Jl 



Arg Ala Arg Ser Val Arg Ala Lys Leu 
359 CGG GCC AGA AGT GTC CGC GCT AAG CTA 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 
Jl 386 CTG TCC CAA GGG GGG AGG GCC GCC ACT 

G 

g 

Gln(=) 



Lys Gly Lys Tyr Leu 
Jl 413 TGT GGC AAG TAC CTC 

FIG. 5-2 
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J7 1 AGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGT 
HCVl 

J7 37 ACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAGG 
HCVl 

Met Ser Thr Asn 

J7 73 TCTCGTAGACCGTGCATC ATG AGC ACA AAT 
HCVl C ^ 



Pro Lys Pro Gin Arg Lys Thr Lys Arg 
J7 103 CCT AAA CCC CAA AGA AAA ACC AAA CGT 
HCVl T A A 

Lys Asn 
*** 



Asn Thr Asn Arg Arg Pro Gin Asp Val 
J7 130 AAC ACC AAC CGT CGC CCA CAG GAC GTT 
HCVl ^ 

Lys Phe Pro Gly Gly Gly Gin lie Val 
J7 157 AAG TTC CCG GGC GGT GGT CAG ATC GTC 
HCVl T C T 

Gly Gly Val Tyr Leu Leu Pro Arg Arg 
J7 184 GGT GGA GTT TAC TTG TTG CCG CGC AGG 
HCVl 

Gly Pro Arg Leu Gly Val Arg Ala Thr 
J7 211 GGC CCC AGG TTG GGT GTG CGT GCG ACT 
HCVl T A C G 



Arg Lys Thr Ser Glu Arg Ser Gin Pro 
J7 238 AGG AAG ACT TCC GAG CGG TCG CAA CCT 
HCVl A 

FIG. 6-1 
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Arg Gly Arg Arg Gin Pro He Pro Lys 
J7 265 CGT GGA AGG CGA CAA CCT ATC CCC AAG 
HCVl A T A T G 

Ala Arg Arg Pro Glu Gly Arg Thr Trp 
J7 292 GCT CGC CGG CCC GAG GGC AGG ACC TGG 
HCVl T 

Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
J7 319 GCT CAG CCT GGG TAT CCT TGG CCC CTC 

C C 



HCVl 



HCVl 



Tyr Gly Asn Glu Gly Leu Gly Trp Ala 
J7 346 TAT GGC AAT GAG GGC TTG GGG TGG GCA 

GC G 



HCVl 



HCVl 



HCVl 



HCVl 



Cys 
*** 



Gly Trp Leu Leu Ser Pro Arg Gly Ser 
J7 373 GGA TGG CTC CTG TCA CCC CGC GGC TCT 

T T 



Arg Pro Ser Trp Gly Pro Asn Asp Pro 
J7 400 CGG CCT AGT TGG GGC CCC AAT GAC CCC 

C CA 

Thr 



Arg Arg Arg Ser Arg Asn Leu Gly Lys 
J7 427 CGG CGT AGG TCG CGT AAT TTG GGT AAG 

C 



Val He Asp Thr Leu Thr Cys Gly Phe 
J7 454 GTC ATC GAT ACC CTT ACA TGC GGC TTC 

G 



FIG. 6-2 
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HCVl 



Ala Asp Leu Met Gly Tyr lie Pro Leu 
J7 481 GCC GAC CTC ATG GGG TAG ATT CCG CTT 

A C 



HCVl 



Val Gly Ala Pro Leu Gly Gly Ala Ala 
J7 508 6TC GGC GCC CCC TTA GGG GGC GCT GCC 

T C T A 



HCVl 



Arg Ala Leu Ala His Gly 
J7 535 AGG GCC CTG GCA CAT GGT 

G C 



FIG. 6-3 
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Pro Leu Val Gly Ala Pro Leu Gly Gly 
T CCG CTC GTC GGC GCC CCC TTA GGG GGC 

T C T A 



A 



Ala Ala Arg Ala Leu Ala His Gly Val 
29 GCrr GCC AGG GCC CT6 GCA CAT GGT GTC 

G C 



Arg val Leu Glu Asp Gly Val Asn Tyr 
56 CGG GTT CTG GAG GAC GGG GTG AAC TAT 

A 



83 



Ala Thr Gly Asn Leu Pro Gly cys Ser 
GCA ACA GGG AAT TTG CCC GGT TGC TCT 

C C T T 



Phe ser He Phe Leu Leu Ala Leu Leu 
110 TTC TCT ATC TTC CTC TTG GCT CTG CTG 

T C C C 



ser cys Leu Thr He Pro Ala Ser Ala 
137 TCC TGT TTG ACC ATC CCA GCT TCC GCT 
XC TGGC GC 

Val 

Tvr Glu val Arg Asn Val Ser Gly He 
164 TAT GAA GTG CGC AAC GTG TCC GGG ATA 
Q Q TCC AG C T 

Gin Ser Thr Leu 

*** *** 



Tvr His Val Thr Asn Asp Cys Ser Asn 
191 TAG CAT GTC ACA AAC GAC TGC TCC AAC 
C c T T C T 

Pro 

FIG 7-1 
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Ser Ser II Val Tyr Glu Ala Ala Asp 
218 TCA AGC ATT GTG TAT GAG GCG GCG GAC 
G T C C T 



Val He Met His Ala Pro Gly Cys Val 

245 GTG ATC ATG CAT GCC CCC 6GG TGC GTG 

CC C C A T G C 
Ala Leu Thr 



Pro Cys Val Arg Glu Asn Asn Ser Ser 
272 CCC TGC GTT CGG GAG AAC AAT TCC TCC 
tj, T GG C G G 

Gly Ala 
*** 



Arg Cys Trp Val 
299 CGT TGC TGG GTA 
A G T G 



Ala Leu Thr Pro Thr 
GCG CTC ACT CCC ACG 

AG C T 

Met 



Leu Ala Ala Arg Asn Ala Ser Val Pro 
326 CTC GCG GCC AGG AAT GCC AGC GTC CCC 
GGCA G GAAC 

Val Thr Asp Gly Lys Leu 



*** 



Thr Thr Thr Leu Arg Arg His Val Asp 
353 ACT ACG ACA TTA CGA CGC CAC GTC GAC 
G G CAG C T T A T 

Ala Gin He 
*** 



Leu Leu Val Gly Thr Ala Ala Phe Cys 
380 TTG CTC GTT GGG ACG GCT GCT TTC TGC 
C T C GC C A C C T 

Ser Thr Leu 

FIG. 7-2 
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Ser Ala Met Tyr Val Gly Asp Leu Cys 
407 TCC GCT ATG TAC GTG GG6 GAT CTC TGC 
G C C C C A 

Leu 

Gly ser Val Phe Leu He Ser Gin Leu 
434 GGA TCT GTT TTC CTC ATC TCC CAG CTG 
G C T T G GG A 

Val Gly 



Phe Thr Phe Ser Pro Arg Arg His Glu 
461 TTC ACC TTC TCG CCT CGC CGG CAT GAG 

T CAG C C TG 

Trp 
*** 



Thr val Gin Asp Cys Asn Cys Ser He 

488 ACA GTA CAG GAC TGC AAC TGC TCA ATC 

G ACG A GT T T 

Thr Gly 
*** *** 



Tyr Pro Gly His Val Ser Gly His Arg 

515 TAT CCC GGC CAC GTA TCA GGC CAT CGC 

T A AG T C 
He Thr 



Met Ala Trp Asp Met Met Met Asn Trp 
542 ATG GCT TGG GAT ATG ATG ATG AAC TGG 
A 



Ser Pro Thr Ala 
569 TCG CCC ACG GCA 
C T AG 
Thr 



FIG. 7-3 
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Asn Trp Ser Pro Thr Ala 
Jl 1 AAC TGG TCG CCC ACG GCA 
HCVl C T AG 

"^^"^ Thr 



Ala Leu Val Val Ser Gin Leu Leu Arg 
Jl 19 GCC TTA GTG GTG TCG CAG TTA CTC CGG 



HCVl 



G G AA GT CG 
Met Ala 



He Pro Gin Ala Val Met Asp Met Val 
Jl 46 ATC CCA CAA GCT GTC ATG GAC ATG GTG 

CAT AC 
^ lie Lu lie 

Ala Gly Ala His Trp Gly Val Leu Ala 
Jl 73 GCG GGG GCC CAC TGG GGA GTC CTA GCG 
HCVl T T T G 



Gly Leu Ala Tyr Tyr Ser Met Val Gly 
Jl 100 GGC CTT GCC TAG TAT TCC ATG GTG GGG 

A A G T TC 
He Phe 



HCVl 



HCVl 



Asn Trp Ala Lys Val Leu He Val Met 
Jl 127 AAC TGG GCT AAG GTT TTG ATT GTG ATG 

G C C G A C 

Val Leu 



Leu Leu Phe Ala Gly Val Asp Gly His 
Jl 154 CTA CTC TTT GCC GGC GTT GAC GGG CAT 



Thr Arg Val Thr Gly Gly Val Gin Gly 
Ti 181 ACC CGC GTG ACG GGG GGG GTG CAA GGC 

HCVl ACC A AGT GCC 

•kick *** 

FIG. 8-1 
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His Val Thr Ser Thr Leu Thr Ser Leu 
Jl 208 CAC GTC ACC TCT ACA CTC ACG TCC CTC 
HCVl ACT GTG GG T T GTT AG 

Thr val Gly Phe Val 
*** *** *** 



Phe 

Jl 235 TTT 
HCVl C C 

Leu 



Arg Pro Gly Ala 
AGA CCT GGG GCG 
GC ACC 

Ala 
*** 



Ser Gin Lys lie 
TCC CA6 AAA ATT 
AAG C G C 

Lys Asn Val 
*** 



Gin Leu Val Asn Thr Asn Gly Ser Trp 
Jl 262 CAG CTT GTA AAC ACC AAT GGC AGT TGG 

HCVl G A C C 

He 



His He Asn Arg Thr Ala Leu Asn Cys 
Jl 289 CAT ATC AAC AGG ACT GCC CTG AAC TGC 
HCVl C C T C G 

Leu Ser 
*** 



Asn Asp Ser Leu Gin Thr Gly Phe Leu 
Jl 316 AAT GAC TCC CTC CAA ACT GGG TTC CTT 
HCVl TAG AC C CGGTG 

Asn Trp 



Ala Ala Leu 
Jl 343 GCC GCG CTG 

HCVl AG T 

Gly 



FIG. 8-2 
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Ser Val He 

jl 1 C TCA GTG ATC 

HCVl ggctataccggcgacttcga G A 

Asp Cys Asn Thr Cys Val Thr Gin Thr 
Jl 11 GAG TGT AAC ACA TGT GTC ACT CAG ACG 

HCVl C T G C A 



Val Asp Phe Ser Leu Asp Pro Thr Phe 
Jl 38 GTC GAT TTC AGC TTG GAT CCC ACC TTC 

HCVl C T C T 







Thr 


He 


Glu 


Thr 


Thr 


Jl 


65 


ACC 


ATC 


GAG 


ACG 


ACG 


HCVl 






T 




A 


TC 
He 






Asp 


Ala 


Val 


Ser 


Arg 


Jl 


92 


GAT 


GCG 


GTT 


TCG 


CGC 


HCVl 






T 


C 


C 








Gly 


Arg 


Thr 


Gly 


Arg 


Jl 


119 


GGT 


AG6 


ACT 


GGC 


AGG 



HCVl 



G C C 



G A CC 
Lys Pro 



He Tyr Arg Phe Val Thr Pro Gly Glu 
Jl 146 ATC TAT AGG TTT GTG ACT CCA GGA GAA 

HCVl C A G A G G G 

Ala 





Arg 


Pro 


Ser 


Ala 


Jl 


173 CGG 


CCC 


TCG 


GCG 


HCVl 


C 




C 


GC 
Gly 



FIG. 9-1 
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Val Leu Cys Glu Cys Tyr Asp Ala Gly 
Jl 200 GTC CTA TGT GAG TGT TAT GAC GCG GGC 

C C A 



HCVl 



Cys Ala Trp Tyr Glu Leu Thr Pro Ala 
Jl 227 TGT GCT TGG TAT GAG CTC ACG CCC GCT 

HCVl 

Glu Thr Ser Val Arg Leu Arg Ala Tyr 
Jl 254 GAG ACC TCG GTT AGG TTG CGG GCT TAC 

T A A C A A G 

Thr 



HCVl 



Leu Asn Thr Pro Gly Leu Pro Val Cys 
Jl 281 CTA AAT ACA CCA GGG TTG CCC GTC TGC 

AGCCG CT G 



HCVl 



HCVl 



Met 



Gin Asp His Leu Glu Phe Trp Glu Ser 
Jl 308 CAG GAC CAT CTG GAG TTC TGG GAG AGC 

T A T G 

Gly 



Val Phe Thr Gly Leu Thr His lie Asp 
Jl 335 GTC TTC ACA GGC CTC ACC CAC ATA GAC 

HCVl T T T T 

Ala His Phe Leu Ser Gin Thr Lys Gin 
Jl 362 GCC CAC TTC TTG TCC CAG ACT AAG CAG 

HCVl T C A A 

Ala Gly Asp Asn Phe Pro Tyr Leu Val 
Jl 389 GCA GGA GAC AAC TTC CCC TAC CTG GTA 

HCVl AGT G G C T T 

Ser Glu Leu 

FIG. 9-2 



51 



EP 0 419 182 A1 



Ala Tyr Gin Ala Thr Val cys Ala Arg 
416 GCA TAC CAA GCC ACA GTG TGC GCC AGG 
G C T 



Ala Lys Ala Pro Pro Pro Ser Trp Asp 
443 OCT AAG GCT CCA CCT CCA TCG TGG GAT 
C A C T C C 
Gin 



Gin Met Trp Lys Cys Leu He Arg Leu 
470 CAA ATG TGG AAG TGT CTC ATA CGG CTA 
Q T G T C C 

Lys Pro Thr Leu His Gly Pro Thr Pro 
497 AAG CCT ACG CTG CAC GGG CCA ACG CCC 
C C C T A 

Leu Leu Tyr Arg Leu Gly Ala Val Gin 
524 CTG CTG TAT AGG CTA GGA GCC GTC CAG 
A C A G C T T 

Asn Glu Val Thr Leu Thr His Pro He 

551 AAT GAG GTC ACC CTC ACA CAC CCT ATA 

A A G G A G C 

He val 



Thr Lys 
578 ACC AAA 

tacatcatgacatgcatgtc 

FIG. 9-3 
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Leu Thr 

Jl 1 C CTC ACC 

HCVl 



Arg Asp Pro Thr Val Pro Leu Ala Arg 
CGT GAC CCC ACC GTC CCC CTT GCG CGG 
T A AC C A A 

Thr 
*** 

Ala Ala Trp Glu Thr Ala Arg His Thr 
Jl 35 GCT GCG TGG GAG ACA GCT AGA CAC ACT 
HCVl ^ 



Jl 8 
HCVl 



Pro Val Asn Ser Trp Leu Gly Asn lie 
Jl 62 CCA GTC AAC TCC TGG CTA GGC AAC ATC 
HCVl T A 



HCVl 



lie Met Tyr Ala Pro Thr Leu Trp Ala 
Jl 89 ATC ATG TAT GCG CCC ACT TTG TGG GCA 

T C AC G 

Phe 



Arg Met lie Leu Met Thr His Phe Phe 
Jl 116 AGG ATG ATT CTG ATG ACT CAC TTC TTC 
HCVl . A C T T 



Ser 

Jl 143 TCC 
HCVl AG 



lie Leu Leu Ala 
ATC CTT CTA GCC 
G A 
Val He 



Gin Glu Gin Leu 
CAG GAG CAA CTT 
AG C G 

Arg Asp 
*** 



Glu Lys Ala Leu Asp Cys Gin He Tyr 
Jl 170 GAA AAA GCC CTG GAT TGT CAA ATC TAC 
HCVl C G C CGG 

Gin Glu 
*** *** 

FIG. 10-1 
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Gly Ala cys Tyr Ser lie Glu Pro Leu 
Jl 197 GGG GCC TGT TAG TOO ATT GAG CCA CTT 
HCVl C A A 

Asp Leu Pro Gin He He Glu Arg Leu 
Jl 224 GAC CTA CCT CAG ATC ATT GAA CGA CTC 

Gin"" 
*** *** 



His Gly Leu Ser Ala Phe Ser Leu His 
CAT GGT CTT 
HCVl C C 



Jl 251 CAT GGT CTT AGC GCA TTT TCA CTC CAT 



Ser Tyr Ser Pro Gly Glu He Asn Arg 
Jl 278 AGT TAG TCT CCA GGT GAG ATC AAT AGG 

AT 



HCVl 



Val Ala Ser Cys Leu Arg Lys Leu Gly 
Jl 305 GTG GCT TCA TGC CTC AGG AAG CTT GGG 

C G A A 

Ala 



HCVl 



Val Pro Pro Leu Arg Val Trp Arg His 
Jl 332 GTA CCA CCC TTG CGA GTC TGG AGA CAT 

HCVl G ^ 

Ala 

*** 



Arg Ala Arg Ser Val Arg Ala Lys Leu 
Jl 359 CGG GCC AGA AGT GTC CGC GCT AAG CTA 

C G C G T 

Arg 



HCVl 



Leu Ser Gin Gly Gly Arg Ala Ala Thr 
Tl 386 CTG TCC CAA GGG GGG AGG GCC GCC ACT 
HCvi G AG A C ^ 

Ala Arg 



*** 



■k-k-k 



FIG. 10-2 
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GCGTCTAGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGG 
CGCAGATCGGTACCGCAATCATACTCACAGCACGTCGGAGGTCC 

ACCCCCCCTCCCGGGAGAGCCATAGTGGTCTGCGGAACCGGTGA 
TGGGGGGGAGGGCCCTCTCGGTATCACCAGAC6CCTTGGCCACT 



GTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAA 
CATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCTAGTT 



CCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGC 
GGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACG 



TAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
ATCGGCTCATCACAACCCAGCGCTTTCCGGAACACCATGACGGA 

GATAGGGTGCTTGCGAGTGCCCCGGGAGGTCTCGTAGACCGTGC 
CTATCCCACGAACGCTCACGGGGCCCTCCAGAGCATCTGGCACG 



ACC -1 
TGG 

Met Ser Thr Asn Pro Lys Pro Gin Lys Lys Asn 
ATG AGC ACG AAT CCT AAA CCT CAA AAA AAA AAC 
TAG TCG TGC TTA GGA TTT GGA GTT TTT TTT TTG 

Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp Val 
KaA CGT AAC ACC AAC CGT CGC CCA CAG GAC GTC 
TTT GCA TTG TGG TTG GCA GCG GGT GTC CTG CAG 

Lvs Phe Pro Gly Gly Gly Gin He Val Gly Gly 
TtS CCG GGT GGC GGT CAG ATC GTT GGT GGA 
TTC AAG GGC CCA CCG CCA GTC TAG CAA CCA CCT 
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Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu 
100 GTT TAC TTG TTG CCG CGC AGG GGC CCT AGA TT6 
CAA ATG AAC AAC GGC GCG TCC CCG GGA TCT AAC 



Gly val Arg Ala Thr Arg Lys Thr Ser Glu Arg 
133 GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG CGG 
CCA CAC GCG CGC TGC TCT TTC TGA AGG CTC GCC 



Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro 
166 TGG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC 
AGC GTT GGA GCT CCA TCT GCA GTC GGA TAG GGG 



Lys Ala Arg Arg Pro Glu Gly Arg Thr Trp Ala 
199 AAG GCT CGT CGG CCC GAG GGC AGG ACC TGG GCT 
TTC CGA GCA GCC GGG CTC CCG TCC TGG ACC CGA 



Gin Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn 
232 CAG CCC GGG TAC CCT TGG CCC CTC TAT GGC AAT 
GTC GGG CCC ATG GGA ACC GGG GAG ATA CCG TTA 



Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser 
265 GAG GGC TGC GGG TGG GCG GGA TGG CTC CTG TCT 
CTC CCG ACG CCC ACC CGC CCT ACC GAG GAC AGA 



Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr 
298 CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA 
GGG GCA CCG AGA GCC GGA TCG ACC CCG GGG TGT 



Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys 
331 GAC CCC CGG CGT AGG TCG CGC AAT TTG GGT AAG 
CTG GGG GCC GCA TCC AGC GCG TTA AAC CCA TTC 



Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp 
364 GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC 
CAG TAG CTA TGG GAA TGC ACG CCG AAG CGG CTG 
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Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro 
397 CTC ATG GGG TAC ATA CCG CTC GTC GGC GCC CCT 
GAG TAC CCC ATG TAT GGC GAG CAG CCG CGG GGA 

Leu Gly Gly Ala Ala Arg Ala Leu Ala His Gly 
430 CTT GGA GGC GCT GCC AGG GCC CTG GCG CAT GGC 
GAA CCT CCG CGA CGG TCC CGG GAC CGC GTA CCG 

Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
463 GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 
CAG GCC CAA GAC CTT CTG CCG CAC TTG ATA CGT 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
496 ACA GGG AAC CTT CCT GGT TGC TCT TTC TCT ATC 
TGT CCC TTG GAA GGA CCA ACG AGA AAG AGA TAG 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val 
529 TTC CTT CTG GCC CTG CTC TCT TGC TTG ACT GTG 
AAG GAA GAC CGG GAC GAG AGA ACG AAC TGA CAC 

Pro Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr 
562 CCC GCT TCG GCC TAC CAA GTG CGC AAC TCC ACG 
GGG CGA AGC CGG ATG GTT CAC GCG TTG AGG TGC 

Gly Leu Tyr His Val Thr Asn Asp Cys Pro Asn 
595 GGG CTT TAC CAC GTC ACC AAT GAT TGC CCT AAC 
CCC GAA ATG GTG CAG TGG TTA CTA ACG GGA TTG 

Ser Ser He Val Tyr Glu Ala Ala Asp Ala He 
628 TCG AGT ATT GTG TAC GAG GCG GCC GAT GCC ATC 
AGC TCA TAA CAC ATG CTC CGC CGG CTA CGG TAG 

Leu His Thr Pro Gly Cys Val Pro Cys Val Arg 
661 CTG CAC ACT CCG GGG TGC GTC CCT TGC GTT CGT 
GAC GTG TGA GGC CCC ACG CAG GGA ACG CAA GCA 
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Glu Gly Asn Ala Ser Arg Cys Trp Val Ala Met 
694 GAG GGC AAC GCC TCG AGG TGT TGG GTG GCG ATG 
CTC CCG TTG CGG AGC TCC ACA ACC CAC CGC TAC 



Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu 
727 ACC CCT ACG GTG GCC ACC AGG GAT GGC AAA CTC 
TGG GGA TGC CAC CGG TGG TCC CTA CCG TTT GAG 

Pro Ala Thr Gin Leu Arg Arg His He Asp Leu 
760 CCC GCG ACG CAG CTT CGA CGT CAC ATC GAT CTG 
GGG CGC TGC GTC GAA GCT GCA GTG TAG CTA GAC 

Leu Val Gly Ser Ala Thr Leu cys Ser Ala Leu 
793 CTT GTC GGG AGC GCC ACC CTC TGT TCG GCC CTC 
GAA CAG CCC TCG CGG TGG GAG ACA AGC CGG GAG 

Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu 
826 TAC GTG GGG GAC CTA TGC GGG TCT GTC TTT CTT 
ATG CAC CCC CTG GAT ACG CCC AGA CAG AAA GAA 

val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg 
859 GTC GGC CAA CTG TTC ACC TTC TCT CCC AGG CGC 
CAG CCG GTT GAC AAG TGG AAG AGA GGG TCC GCG 

His Trp Thr Thr Gin Gly Cys Asn Cys Ser lie 
892 CAC TGG ACG ACG CAA GGT TGC AAT TGC TCT ATC 
GTG ACC TGC TGC GTT CCA ACG TTA ACG AGA TAG 

Tvr Pro Gly His He Thr Gly His Arg Met Ala 
925 TAT CCC GGC CAT ATA ACG GGT CAC CGC ATG GCA 
ATA GGG CCG GTA TAT TGC CCA GTG GCG TAC CGT 



Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr 
958 TGG GAT ATG ATG ATG AAC TGG TCC CCT ACG ACG 
ACC CTA TAC TAC TAC TTG ACC AGG GGA TGC TGC 
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Ala Leu Val Met Ala Gin Leu Leu Arg lie Pro 
991 GCG TTG GTA ATG GCT CAG CTG CTC CGG ATC CCA 
CGC AAC CAT TAC CGA GTC GAC GAG GCC TAG GGT 

rin Ala lie Leu Asp Met lie Ala Gly Ala His 
1024 ^ GcS AT? T?G GAC ATG ATC GCT GGT GCT CAC 
GTT CGG TAG AAC CTG TAC TAG CGA CCA CGA GTG 

Trp Gly val Leu Ala Gly He Ala Tyr Phe Ser 
1057 T^ GGA GTC CTG GCG GGC ATA GCG TAT TTC TCC 
ACC CCT CAG GAC CGC CCG TAT CGC ATA AAG AGG 

vt^i- val Glv Asn Trp Ala Lys Val Leu Val Val 
1090 G?G tII GCG GTC CTG GTA GTG 

TAC CAC CCC TTG ACC CGC TTC CAG GAC CAT CAC 

Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr 
iir>T rre CTG CTA TTT GCC GGC GTC GAC GCG GAA ACC 
GAC GAC ^T Ha CGG CCG CAG CTG CGC CTT TGG 

His Val Thr Gly Gly Ser Ala Gly His Thr Val 
1156 ?AC G?C ACC GGG GGA AGT GCC GGC CAC ACT GTG 
OTG CAG TGG CCC CCT TCA CGG CCG GTG TGA CAC 

ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala 

1189 TCT GGA TTT GTT AGC CTC CTC GCA CCA GGC GCC 
1189 TCT GGA ^ CGG 

Lys Gin Asn Val Gin Leu He Asn Thr Asn Gly 
155 2 AAG CAG AAC GTC CAG CTG ATC AAC ACC AAC GGC 
^C GTC TTG CAG GTC GAC TAG TTG TGG TTG CCG 

Ser Trp His Leu Asn Ser Thr Ala Leu Asn Cys 
1255 AOT TGG CAC CTC AAT AGC ACG GCC CTG AAC TGC 
TCA ACC GTG GAG TTA TCG TGC CGG GAC TTG ACG 
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Asn Asp Ser Leu Asn Thr Gly Trp Leu Ala Gly 
1288 AAT GAT AGC CTC AAC ACC GGC T6G TTG GCA GG6 
TTA CTA TCG GAG TTG TGG CCG ACC AAC CGT CCC 

Leu Phe Tyr His His Lys Phe Asn Ser Ser Gly" 
1321 TTT TCT ATC ACC ACA AGT TCA ACT CTT CAG GCT 
GAA AAG ATA GTG GTG TTC AAG TTG AGA AGT CCG 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu 
1354 GTC CTG AGA GGC TAG CCA GCT GCC GAC CCC CTT 
ACA GGA CTC TCC GAT CGG TCG ACG GCT GGG GAA 

Thr Asp Phe Asp Gin Gly Trp Gly Pro lie Ser 
i-xft7 ACC GAT TTT GAC CAG GGC TGG GGC CCT ATC AGT 
TGG CTA AAA CTG GTC CCG ACC CCG GGA TAG TCA 

Tyr Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro 
1420 TAT GCC AAC GGA AGC GGC CCC GAC CAG CGC CQ^ 
ATA CGG TTG CCT TCG CCG GGG CTG GTC GCG GGG 

Tvr CYS Trp His Tyr Pro Pro Lys Pro Cys Gly 
1453 TAC TGC TGG CAC TAC CCC CCA AAA CCT TGC GGT 
ATG ACG ACC GTG ATG GGG GGT TTT GGA ACG CCA 

He Val Pro Ala Lys Ser Val Cys Gly Pro Val 
1486 ATT GTG CCC GCG AAG AGT GTG TGT GGT CCG GTA 
TAA CAC GGG CCC TTC TCA CAC ACA CCA GGC CAT 

Tvr CVS Phe Thr Pro Ser Pro Val Val Val Gly 
1519 TAT TGC TTC ACT CCC AGC CCC GTG GTG GTG GGA 
ATA ACG AAG TGA GGG TCG GGG CAC CAC CAC CCT 

Thr Thr Asp Arg Ser Gly Ala Pro Thr Tyr ser 

1552 ACG ACC GAC AGG TCG GGC GCG CCC ACC TAC AGC 
TGC TGG CTG TCC AGC CCG CGC GGG TGG ATG TCG 
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Trp Gly Glu Asn Asp Thr Asp Val Ph Val Leu 
1585 TGG GGT GAA AAT GAT ACG GAC GTC TTC GTC CTT 
ACC CCA CTT TTA CTA TGC CTG CAG AAG CAG GAA 

Asn Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe 
1618 aIS A^T ACC AgI CCA CCG CTG GGC AAT TGG TTC 
TTG TTA TGG TCC GGT GGC GAC CCG TTA ACC AAG 

Glv CVS Thr Trp Met Asn Ser Thr Gly Phe Thr 
1651 GGT TGT ACC TGG ATG AAC TCA ACT GGA TTC ACC 
C^ ACA TGG ACC TAC TTG ACT TGA CCT AAG TGG 

Lys Val Cys Gly Ala Pro Pro Cys Val lie Gly 
1684 AAA GTG TGC GGA GCG CCT CCT TGT GTC ATC GGA 
TTT CAC ACG CCT CGC GGA GGA ACA CAG TAG CCT 

Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr 
1717 GGG GCG GGC AAC AAC ACC CTG CAC TGC CCC ACT 
CCC CGC CCG TTG TTG TGG GAC GTG ACG GGG TGA 

AsD CVS Phe Arg Lys His Pro Asp Ala Thr Tyr 
1750 gIt TGC iTC CGC AAG CAT CCG GAC GCC ACA TAC 
CTA ACG AAG GCG TTC GTA GGC CTG CGG TGT ATG 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro 

1783 tS CGG TGC GGC TCC GGT CCC TGG ATC ACA CCC 
1783 TCT CGG TGC 



Arg cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp 

1816 AGG TGC CTG GTC GAC TAC CCG TAT AGG CTT TGG 
TCC IcG GAC CAG CTG ATG GGC ATA TCC GAA ACC 



His Tyr Pro Cys Thr He Asn Tyr Thr He Phe 
1849 CAT TAT CCT TGT ACC ATC AAC TAC ACC ATA TTT 
GTA ATA GGA ACA TGG TAG TTG ATG TGG TAT AAA 
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Lys lie Arg Met Tyr Val Gly Gly Val Glu His 
1882 AAA ATC AGO ATG TAC GTG GGA GGG GTC GAA CAC 
TTT TAG TCC TAC ATG CAC CCT CCC CAG CTT GTG 



Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly 
1915 AGG CTG GAA GCT GCC TGC AAC TGG ACG CGG GGC 
TCC GAC CTT CGA CGG ACG TTG ACC TGC GCC COG 



Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser 
1948 GAA CGT TGC GAT CTG GAA GAC AGG GAC AGG TCC 
CTT GCA ACG CTA GAC CTT CTG TCC CTG TCC AGG 



Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr Gin 
1981 GAG CTC AGC CCG TTA CTG CTG ACC ACT ACA CAG 
CTC GAG TCG GGC AAT GAC GAC TGG TGA TGT GTC 



Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu 
2014 TGG CAG GTC CTC CCG TGT TCC TTC ACA ACC CTA 
ACC GTC CAG GAG GGC ACA AGG AAG TGT TGG GAT 



Pro Ala Leu Ser Thr Gly Leu lie His Leu His 
2047 CCA GCC TTG TCC ACC GGC CTC ATC CAC CTC CAC 
GGT CGG AAC AGG TGG CCG GAG TAG GTG GAG GTG 



Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
2080 CAG AAC ATT GTG GAC GTG CAG TAC TTG TAC GGG 
GTC TTG TAA CAC CTG CAC GTC ATG AAC ATG CCC 



Val Gly Ser Ser lie Ala Ser Trp Ala lie Lys 
2113 GTG GGG TCA AGC ATC GCG TCC TGG GCC ATT AAG 
CAC CCC AGT TCG TAG CGC AGG ACC CGG TAA TTC 



Trp Glu Tyr Val Val Leu Leu Phe Leu Leu Leu 
2146 TGG GAG TAC GTC GTT CTC CTG TTC CTT CTG CTT 
ACC CTC ATG CAG CAA GAG GAC AAG GAA GAC GAA 
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Ala ASD Ala Arg Val Cys Ser Cys Leu Trp Met 
2179 gIc GC§ CGC GTC TGC TCC TGC TTG TGG ATG 

CGT CTG CGC GCG CAG ACG AGG ACG AAC acc tac 

Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu 
^oio aqir PTA CTC ATA TCC CAA GCG GAG GCG GCT iHa 
TAC St GAG TAT AGG GTT CGC CTC CGC CGA AAC 

riu Asn Leu Val He Leu Asn Ala Ala Ser Leu 
2245 G^G C?C OTA ATA CTT AAT GCA GCA TCC CTG 

CTC TTG GAG CAT TAT GAA TTA CGT CGT AGG GAC 

HI a Giv Thr His Gly Leu Val Ser Phe Leu Val 
oo^p rrr III ACG CAC GGT CTT GTA TCC TTC CTC GTG 
CGG CCC TGC CTG CCA GAA CAT AGG AAG GAG CAC 



2311 



Phe Phe cys Phe Ala Trp Tyr Leu Lys Gly Lys 
TTC TTC TGC TTT GCA TGG TAT TTG AAG GGT AAG 
Hg Sg ACG AAA CGT ACC ATA AAC TTC CCA TTC 

TrD val pro Gly Ala Val Tyr Thr Phe Tyr Gly 
2344 T?G GTG CCC GGA GCG CTC TAC ACC TTC TAC GGG 
ACC CAC GGG CCT CGC CAG ATG TGG AAG ATG CCC 

Met Trp pro Leu Leu Leu Leu Leu Leu Ala Leu 
2377 ATG TGG CCT CTC CTC CTG CTC CTG TTG GCG TTG 
" TAC ACC GGA GAG GAG GAC GAG GAC AAC CGC AAC 

Pro Gin Arg Ala Tyr Ala Leu Asp Thr Glu Val 
III ?AG CGG GCG TAC GCG CTG GAC ACG GAG GTG 
GGG CTC GCC CGC ATG CGC GAC CTG TGC CTC CAC 

Ala Ala ser Cvs Gly Gly Val Val Leu Val Gly 

2443 GCC Gcl ?Sg Ool Gg| GTT GTT CTC GTC GGG 

CGG CGC AGC ACA CCG CCA CAA CAA GAG CAG CCC 
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Leu Met Ala Leu Thr Leu Ser Pro Tyr Tyr Lys 
2476 TTG ATG GCG CTG ACT CTG TCA CCA TAT TAC AAG 
AAC TAC CGC GAC TGA GAC AGT GGT ATA ATG TTC 

Arg Tyr He Ser Trp Cys Leu Trp Trp Leu Gin 
2509 CGC TAT ATC AGC TGG TGC TTG TGG TGG CTT CAG 
GCG ATA TAG TCG ACC ACG AAC ACC ACC GAA GTC 

Tyr Phe Leu Thr Arg Val Glu Ala Gin Leu His 
2542 TAT TTT CTG ACC AGA GTG GAA GCG CAA CTG CAC 
ATA AAA GAC TGG TCT CAC CTT CGC GTT GAC GTG 



Val Trp He 
2575 GTG TGG ATT 
CAC ACC TAA 



Pro Pro Leu Asn 
CCC CCC CTC AAC 
GGG GGG GAG TTG 



Val Arg Gly Gly 
GTC CGA GGG GGG 
CAG GCT CCC CCC 



Arg Asp Ala Val He Leu Leu Met Cys Ala Val 
2608 CGC GAC GCC GTC ATC TTA CTC ATG TGT GCT GTA 
GCG CTG CGG CAG TAG AAT GAG TAC ACA CGA CAT 

His Pro Thr Leu Val Phe Asp He Thr Lys Leu 
2641 CAC CCG ACT CTG GTA TTT GAC ATC ACC AAA TTG 
GTG GGC TGA GAC CAT AAA CTG TAG TGG TTT AAC 



Leu Leu Ala Val Phe Gly Pro Leu Trp He Leu 
2674 CTG CTG GCC GTC TTC GGA CCC CTT TGG ATT CTT 
GAC GAC CGG CAG AAG CCT GGG GAA ACC TAA GAA 



Gin Ala Ser Leu Leu 
2707 CAA GCC AGT TTG CTT 
GTT CGG TCA AAC GAA 



Lys Val Pro Tyr Phe Val 
AAA GTA CCC TAC TTT GTG 
TTT CAT GGG ATG AAA CAC 



Arg Val Gin 
2740 CGC GTC CAA 
GCG CAG GTT 



Gly Leu Leu Arg 
GGC CTT CTC CGG 
CCG GAA GAG GCC 
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Ala Arg Lys 
2773 GCG CGG AAG 
CGC GCC TTC 



Met He Gly Gly 
ATG ATC GGA GGC 
TAG TAG CCT CCG 



His Tyr Val Gin 
CAT TAG GTG CAA 
GTA ATG CAC GTT 



Met Val He 
2806 ATG GTC ATC 
TAG GAG TAG 



He Lys Leu Gly 
ATT AAG TTA GGG 
TAA TTC AAT CCC 



Ala Leu Thr Gly 
GCG CTT ACT GGC 
CGC GAA TGA CCG 



Thr Tvr Val Tyr Asn His Leu Thr Pro Leu Arg 
2839 ACC tLI G?T TAT AAC CAT CTC ACT CCT CTT CGG 
^ TGG ATA CAA ATA TTG GTA GAG TGA GGA GAA GCC 

ASD Trp Ala His Asn Gly Leu Arg Asp Leu Ala 
2872 gIc T?G GCG CAC AAC GGC TTG GGA GAT CTG GCC 
CTG ACC CGC GTG TTG CCG AAC GCT CTA GAC CGG 



val Ala Val 
2905 GTG GCT GTA 
CAC CGA CAT 



Glu Pro Val Val 
GAG CCA GTC GTC 
CTC GGT CAG CAG 



Phe Ser Gin Met 
TTC TCC CAA ATG 
AAG AGG GTT TAG 



Glu Thr Lys Leu He Thr Trp Gly Ala Asp Thr 
2938 GAG ACC AAG CTC ATC ACG TGG GGG GCA GAT ACC 
CTC TGG TTC GAG TAG TGC ACC CCC CGT CTA TGG 



Ala Ala Cys 
2971 GCC GCG TGC 
CGG CGC ACG 



Gly Asp He He 
GGT GAC ATC ATC 
CCA CTG TAG TAG 



Asn Gly Leu Pro 
AAC GGC TTG CCT 
TTG CCG AAC GGA 



Val Ser Ala 
3004 GTT TCC GCC 
CAA AGG CGG 



Arg Arg Gly Arg 
CGC AGG GGC CGG 
GCG TCC CCG GCC 



Glu He Leu Leu 
GAG ATA CTG CTC 
CTC TAT GAC GAG 



Gly Pro Ala 
3037 GGG CCA GCC 
CCC GGT CGG 



Asp Gly Met Val 
GAT C3GA ATG GTC 
CTA CCT TAG CAG 
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Arg Leu Leu Ala Pro II Thr Ala Tyr Ala Gin 
3070 AGG TTG CTG GCG CCC ATC ACG GCG TAG GCC CAG 
TCC AAC GAC CGC GGG TAG TGC C6C ATG CGG GTC 

Gin Thr Arg Gly Leu Leu Gly Cys He He Thr 
3103 CAG ACA AGG GGC CTC CTA GGG TGC ATA ATC ACC 
GTC TGT TCC CCG GAG GAT CCC ACG TAT TAG TGG 

Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu 
3136 AGC CTA ACT GGC CGG GAC AAA AAC CAA GTG GAG 
TCG GAT TGA CCG GCC CTG TTT TTG GTT CAC CTC 

Gly Glu Val Gin He Val Ser Thr Ala Ala Gin 
3169 GOT GAG GTC CAG ATT GTG TCA ACT GCT GCC CAA 
CCA CTC CAG GTC TAA CAC AGT TGA CGA CGG GTT 

Thr Phe Leu Ala Thr Cys He Asn Gly Val Cys 
3202 ACC TTC CTG GCA ACG TGC ATC AAT GGG GTG TGC 
TGG AAG GAC CGT TGC ACG TAG TTA CCC CAC ACG 

Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
3235 TGG ACT GTC TAC CAC GGG GCC GGA ACG AGG ACC 
ACC TGA CAG ATG GTG CCC CGG CCT TGC TCC TGG 

He Ala Ser Pro Lys Gly Pro Val He Gin Met 
3268 ATC GCG TCA CCC AAG GGT CCT GTC ATC CAG ATG 
TAG CGC AGT GGG TTC CCA GGA CAG TAG GTC TAC 

Tyr Thr Asn Val Asp Gin Asp Leu Val Gly Trp 
3301 TAT ACC AAT GTA GAC CAA GAC CTT GTG GGC TGG 
ATA TGG TTA CAT CTG GTT CTG GAA CAC CCG ACC 

Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr Pro 
3334 CCC GCT CCG CAA GGT AGC CGC TCA TTG ACA CCC 
GGG CGA GGC GTT CCA TCG GCG AGT AAC TGT GGG 
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Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val 
3367 TGC ACT TGC GGC TCC TCG GAC CTT TAG CTG GTC 
ACG TGA ACG CCG AGG AGC CTG GAA ATG GAC CAG 

Thr Arg His Ala Asp Val He Pro Val Arg Arg 
3400 ACG AGG CAC GCC GAT GTC ATT CCC GTG CGC CGG 
TGC TCC GTG CGG CTA CAG TAA GGG CAC GCG GCC 

Arg Gly Ksp Ser Arg Gly Ser Leu Leu Ser Pro 
3433 CGG GGT GAT AGC AGG GGC AGC CTG CTG TCG CCC 
GCC CCA CTA TCG TCC CCG TCG GAC GAC AGC GGG 



Arg Pro He Ser Tyr Leu Lys Gly Ser Ser Gly 
3466 CGG CCC ATT TCC TAC TTG AAA GGC TCC TCG GGG 
GCC GGG TAA AGG ATG AAC TTT CCG AGG AGC CCC 



Gly Pro Leu Leu Cys Pro Ala Gly His Ala Val 
3499 GGT CCG CTG TTG TGC CCC GCG GGG CAC GCC GTG 
CCA GGC GAC AAC ACG GGG CGC CCC GTG CGG CAC 

Gly He Phe Arg Ala Ala Val Cys Thr Arg Gly 
3532 GGC ATA TTT AGG GCC GCG GTG TGC ACC CGT GGA 
CCG TAT AAA TCC CGG CGC CAC ACG TGG GCA CCT 

Val Ala Lys Ala Val Asp Phe He Pro Val Glu 
3565 GTG GCT AAG GCG GTG GAC TTT ATC CCT GTG GAG 
CAC CGA TTC CGC CAC CTG AAA TAG GGA CAC CTC 



Asn Leu Glu Thr Thr Met Arg Ser Pro Val Phe 
3598 AAC CTA GAG ACA ACC ATG AGG TCC CCG GTG TTC 
TTG GAT CTC TGT TGG TAC TCC AGG GGC CAC AAG 



Thr Asp Asn Ser Ser Pro Pro Val Val Pro Gin 
3631 ACG GAT AAC TCC TCT CCA CCA GTA GTG CCC CAG 
TGC CTA TTG AGG AGA GGT GGT CAT CAC GGG GTC 
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ser Phe Gin Val Ala His Leu His Ala Pro Thr 
3664 III TTC CAG GTG GCT CAC CTC CAT GCT CCC ACA 
TCG AAG GTC CAC CGA GTG GAG GTA CGA GGG TGT 



3697 



3730 



riv ser Glv Lvs Ser Thr Lys Val Pro Ala Ala 
ill GGC AAA AGC ACC AAG GTC CCG GCT GCA 
TCG CCG TTT TCG TGG TTC CAG GGC CGA CGT 

Tvir Ala Ala Gin Gly Tyr Lys Val Leu Val Leu 

TAT GCA GCT CAG GGC TAT AAG GTG CTA GTA CTC 
ItI CGT CGA GTC CCG ATA TTC CAC GAT CAT GAG 



ie« Pi-o Ser Val Ala Ala Thr Leu Gly Phe Gly 
III TCT GTT GCT GCA ACA CTG GGC TTT GGT 
^G Sg CGA CGT TGT GAC CCG AAA CCA 

Ala Tvr Met Ser Lys Ala His Gly He Asp Pro 
^•7Qfi GCT TAG ATG TCC AAG GCT CAT GGG ATC GAT CCT 
'^^^ III ItI tII AGG TTC CGA GTA CCC TAG CTA GGA 

aer. Tie Ara Thr Gly Val Arg Thr He Thr Thr 
3829 ItI III ACC GGG GTG AGA ACA ATT ACC ACT 

TTG TAG TCC TGG CCC CAC TCT TGT TAA TGG TGA 



3862 



riv Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys 
GGC III III ATC ACG TAC TCC ACC TAC GGC AAG 
CCG TCG GGG TAG TGC ATG AGG TGG ATG CCG TTC 



Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala 

-^laac TTC CTT GCC GAC GGC GGG TGC TCG GGG GGC GCT 
3895 TTC CTT GCC 



3928 



Tvr ASP He He He Cys Asp Glu Cys His Ser 
?Kt GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC 
ATA CTG TAT TAT TAA ACA CTG CTC ACG GTG AGG 
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Thr ASP Ala Thr Ser lie Leu Gly II Gly Thr 
-^Q^i jlcg gat gcc aca tcc atc ttg ggc atc ggc act 
tgc cta cgg tgt agg tag aac ccg tag ccg tga 

Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg 
3994 GTC CTT GAC CAA GCA GAG ACT GCG GGG GCG AGA 
Sg ^ CTG GTT CGT CTC TGA CGC CCC CGC TCT 

Leu val val Leu Ala Thr Ala Thr Pro Pro Gly 
4027 CTG GTT GTG CTC GCC ACC GCC ACC CCT CCG GGG 
GKC CAA CAC GAG CGG TGG CGG TGG GGA GGC CCG 

Ser Val Thr Val Pro His Pro Asn lie Glu Glu 
4060 TCC GTC ACT GTG CCC CAT CCC AAC ATC GAG GAG 
AGG Sg TGA CAC GGG GTA GGG TTG TAG CTC CTC 

val Ala Leu Ser Thr Thr Gly Glu lie Pro ^ 

Ana-x rTT GCT CTG TCC ACC ACC GGA GAG ATC CCT Xii 
4093 GTT GCT CTG TC^ ^ 

Tvr Glv Lys Ala lie Pro Leu Glu Val He Lys 
4126 ?AC loC Jag GCT ATC CCC CTC GAA GTA ATC AAG 
ATG CCG TTC CGA TAG GGG GAG CTT CAT TAG TTC 

Glv Gly Arg His Leu He Phe Cys His Ser Lys 
Ai^9 GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG 
^ cfc CCC TCT GTA GAG TAG AAG ACA GTA AGT TTC 

Lvs Lys cys Asp Glu Leu Ala Ala Lys Leu Val 
4192 AAG AAG TGC GAC GAA CTC GCC GCA AAG CTG GTC 
^C TTC ACG CTG CTT GAG CGG CGT TTC GAC CAG 

Ala Leu Gly He Asn Ala Val Ala Tyr Tyr Arg 
4225 GCA TTG GGC ATC AAT GCC GTG GCC TAC TAG CGC 
CGT AAC CCG TAG TTA CGG CAC CGG ATG ATG GCG 
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Glv Leu Asp Val Ser Val II Pro Thr Ser Gly 
4258 GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC 
CCA GAA CTG CAC AGG CAG TAG GGC TGG TCG CCG 

AsD Val Val Val Val Ala Thr Asp Ala Leu Met 
4291 GAT GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG 
CTA CAA CAG CAG CAC CGT TGG CTA CGG GAG TAC 

Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val lie 
4324 ACC GGC TAT ACC GGC GAC TTC GAC TCG GTG ATA 
TGG CCG ATA TGG CCG CTG AAG CTG AGC CAC TAT 

ASD CVS Asn Thr Cys Val Thr Gin Thr Val Asp 
A^R7 CAC TGC AAT ACG TGT GTC ACC CAG ACA GTC GAT 
CTG IcG TTA TGC ACA CAG TGG GTC TGT CAG CTA 

Phe Ser Leu Asp Pro Thr Phe Thr He Glu Thr 
4390 TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA 
AAG TCG GAA CTG GGA TGG AAG TGG TAA CTC TGT 

He Thr Leu Pro Gin Asp Ala Val Ser Arg Thr 
4423 ATC ACG CTC CCC CAG GAT GCT GTC TCC CGC ACT 
TAG TGC GAG GGG GTC CTA CGA CAG AGG GCG TGA 



Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG AAG CCA 
GTT GCA GCC CCG TCC TGA CCG TCC CCC TTC GGT 

Glv He Tvr Arg Phe Val Ala Pro Gly Glu Arg 
4489 GGC ATC TAC AGA TTT GTG GCA CCG GGG GAG CGC 
CCG TAG ATG TCT AAA CAC CGT GGC CCC CTC GCG 

Pro ser Gly Met Phe Asp Ser Ser Val Leu Cys 
4522 CCC TCC GGC ATG TTC GAC TCG TCC GTC CTC TGT 
GGG AGG CCG TAC AAG CTG AGC AGG CAG GAG ACA 
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Glu CVS Tyr Asp Ala Gly Cys Ala Trp Tyr Glu 
4555 GaS T6C TAT GAC GCA GGC TGT GCT TGG TAT GAG 
CTC ACG ATA CTG CGT CCG ACA CGA ACC ATA CTC 

Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg 
d^aa CTC ACG CCC GCC GAG ACT ACA GTT AGG CTA CGA 
Sg TGC GGG CGG CTC TGA TGT CAA TCC GAT GCT 

Ala Tyr Met Asn Thr Pro Gly Leu Pro Val Cys 
4621 GCG iKc ATG AAC ACC CCG GGG CTT CCC GTG TGC 
III ATG TAC TTG TGG GGC CCC GAA GGG CAC ACG 



Gin Asp His 
4654 CAG GAC CAT 
GTC CTG GTA 



Leu Glu Phe Trp 
CTT GAA TTT TGG 
GAA CTT AAA ACC 



Glu Gly Val Phe 
GAG GGC GTC TTT 
CTC CCG CAG AAA 



Thr Gly Leu 
4687 ACA GGC CTC 
TGT CCG GAG 



Thr His lie Asp 
ACT CAT ATA GAT 
TGA GTA TAT CTA 



Ala His Phe Leu 
GCC CAC TTT CTA 
CGG GTG AAA GAT 



ser Gin Thr Lys Gin Ser Gly Glu Asn Leu Pro 
4720 tII cIg ACA AAG CAG AGT GGG GAG AAC CTT CCT 
IgG GTC TGT TTC GTC TCA CCC CTC TTG GAA GGA 



Tyr Leu Val 

4753 TAC CTG GTA 
ATG GAC CAT 



Ala Tyr Gin Ala 
GCG TAC CAA GCC 
CGC ATG GTT CGG 



Thr Val Cys Ala 
ACC GTG TGC GCT 
TGG CAC ACG CGA 



Ara Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin 
4786 AGG GCT C^^ GCC CCT CCC CCA TCG TGG GAC CAG 
TCC CGA GTT CGG GGA GGG GGT AGC ACC CTG GTC 

Met Tm Lys Cys Leu He Arg Leu Lys Pro Thr 
4819 ATG TGG AAG TGT TTG ATT CGC CTC AAG CCC ACC 
TAC III TTC ACA AAC TAA GCG GAG TTC GGG TGG 
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Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
4852 CTC CAT GGG CCA ACA CCC CTG CTA TAG AGA CTG 
GAG GTA CCC GGT TGT GGG GAC GAT ATG TCT GAC 

Gly Ala Val Gin Asn Glu He Thr Leu Thr His 
4885 GGC GCT GTT CAG AAT GAA ATC ACC CTG ACG CAC 
CCG CGA CAA GTC TTA CTT TAG TGG GAC TGC GTG 

Pro Val Thr Lys Tyr He Met Thr Cys Met Ser 
4918 CCA GTC ACC AAA TAC ATC ATG ACA TGC ATG TCG 
GGT CAG TGG TTT ATG TAG TAC TGT ACG TAC AGC 

Ala ASD Leu Glu Val Val Thr Ser Thr Trp Val 
4951 GCC GAC CTG GAG GTC GTC ACG AGC ACC TGG GTG 
CGG CTG GAC CTC CAG CAG TGC TCG TGG ACC CAC 

Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
4984 CTC GTT GGC GGC GTC CTG GCT GCT TTG GCC GCG 
GAG CAA IcG CCG CAG GAC CGA CGA AAC CGG CGC 

Tyr cys Leu Ser Thr Gly Cys Val Val lie Val 
5017 TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG 
ATA ACG GAC AGT TGT CCG ACG CAC CAG TAT CAC 



Gly Arg Val Val Leu ser Gly Lys Pro Ala lie 
5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG GCA ATC 
CCG TCC CAG CAG AAC AGG CCC TTC GGC CGT TAG 

Tie Pro Asp Arg Glu Val Leu Tyr Arg Glu Phe 
5083 ATA CCT GAC AGG GAA GTC CTC TAC CGA GAG TTC 
TAT GGA CTG TCC CTT CAG GAG ATG GCT CTC AAG 

Asp Glu Met Glu Glu cys Ser Gin His Leu Pro 
5116 GAT GAG ATG GAA GAG TGC TCT CAG CAC TTA CCG 
CtI CTC TAC CTT CTC ACG AGA GTC GTG AAT GGC 
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Tvr lie Glu Gin Gly Met Met Leu Ala Glu Gin 
5149 TAG ATC GAG CAA GGG ATG ATG CTC GCC GAG GAG 
ATG TAG CTC GTT CCC TAG TAC GAG CGG CTC GTC 

Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr 
5182 TTC AAG CAG AAG GCC CTC GGC CTC CTG CAG ACC 
Hg TTC GTC TTC CGG GAG CCG GAG GAC GTC TGG 

Ala Ser Arg Gin Ala Glu Val He Ala Pro Ala 
5515 GCG TCC CGT CAG GCA GAG GTT ATC GCC CCT GCT 
CGC IgG GCA GTC CGT CTC CAA TAG CGG GGA CGA 

val Gin Thr Asn Trp Gin Lys Leu Glu Thr Phe 
5248 GTC CAG ACC AAC TGG CAA AAA CTC GAG ACC TTC 
CAG GTC TGG TTG ACC GTT TTT GAG CTC TGG AAG 

Trp Ala Lys His Met Trp Asn Phe He Ser Gly 
5281 TGG GCG AAG CAT ATG TGG AAC TTC ATC AGT GGG 
ACC CGC TTC GTA TAC ACC TTG AAG TAG TCA CCC 

He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro 
5314 ATA CAA TAC TTG GCG GGC TTG TCA ACG CTG CCT 
TAT GTT ATG AAC CGC CCG AAC AGT TGC GAC GGA 

Glv Asn Pro Ala He Ala Ser Leu Met Ala Phe 
5347 GGT AAC CCC GCC ATT GCT TCA TTG ATG GCT TTT 
CCA TTG GGG CGG TAA CGA AGT AAC TAC CGA AAA 



Thr Ala Ala Val Thr Ser Pro Leu Thr Thr Ser 
5380 ACA GCT GCT GTC ACC AGC CCA CTA ACC ACT AGC 
TGT CGA CGA CAG TGG TCG GGT GAT TGG TGA TCG 

Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp 
5413 CAA ACC CTC CTC TTC AAC ATA TTG GGG GGG TGG 
CTT TGG GAG GAG AAG TTG TAT AAC CCC CCC ACC 
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Val Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala 
5446 GTG GCT GCC CAG CTC GCC GCC CCC GGT GCC GCT 
CAC CGA CGG GTC GAG CGG CGG GGG CCA CGG CGA 

Thr Ala Phe Val Gly Ala Gly Leu Ala Gly Ala 
5479 ACT GCC TTT GTG GGC GCT GGC TTA GCT GGC GCC 
TGA CGG AAA CAC CCG CGA CCG AAT CGA CCG CGG 



Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
5512 GCC ATC GGC AGT GTT GGA CTG GGG AAG GTC CTC 
CGG TAG CCG TCA CAA CCT GAC CCC TTC CAG GAG 

He Asp He Leu Ala Gly Tyr Gly Ala Gly Val 
5545 ATA GAC ATC CTT GCA GGG TAT GGC GCG GGC GTG 
TAT CTG TAG GAA CGT CCC ATA CCG CGC CCG CAC 

Ala Gly Ala Leu Val Ala Phe Lys He Met Ser 
5578 GCG GGA GCT CTT GTG GCA TTC AAG ATC ATG AGC 
CGC CCT CGA GAA CAC CGT AAG TTC TAG TAC TCG 

Gly Glu Val Pro Ser Thr Glu Asp Leu Val Asn 
5611 GGT GAG GTC CCC TCC ACG GAG GAC CTG GTC AAT 
CCA CTC CAG GGG AGG TGC CTC CTG GAC CAG TTA 



Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu 
5644 CTA CTG CCC GCC ATC CTC TCG CCC GGA GCC CTC 
GAT GAC GGG CGG TAG GAG AGC GGG CCT CGG GAG 



Val Val Gly Val Val Cys Ala Ala He Leu Arg 
5677 GTA GTC GGC GTG GTC TGT GCA GCA ATA CTG CGC 
CAT CAG CCG CAC CAG ACA CGT CGT TAT GAC GCG 



Arg His Val Gly Pro Gly Glu Gly Ala Val Gin 
5710 CGG CAC GTT GGC CCG GGC GAG GGG GCA GTG CAG 
GCC GTG CAA CCG GGC CCG CTC CCC CGT CAC GTC 
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Met Asn Ara Leu lie Ala Phe Ala Ser Arg 
.57d3 IgG A?G CgI C?G ATA GCC TTC GCC TCC CGG 

ACC TAC TTG GCC GAC TAT CGG AAG CGG AGG GCC 

Glv Asn His Val ser Pro Thr His Tyr Val Pro 
^77fi GGG AAC ci? GTT TCC CCC ACG CAC TAC GTG CCG 
C?C ^^G OTA CAA AGG GGG TGC GTG ATG CAC GGC 

rin ser AsD Ala Ala Ala Arg Val Thr Ala He 
5809 GAG III gIt GCA GCT GCC CGC GTC ACT GCC ATA 
CTC TCG CTA CGT CGA CGG GCG CAG TGA CGG TAT 

Leu ser Ser Leu Thr Val Thr Gin Leu Leu Arg 
c;ftd2 CTC AGC AGC CTC ACT GTA ACC CAG CTC CTG AGG 
GAG TCG TCG GAG TGA CAT TGG GTC GAG GAC TCC 

tj»« His Gin Trp He Ser Ser Glu Cys Thr 
..«75 CGA CTG lil CaS T^ ATA AGC TCG GAG TGT ACC 
^^^^ GCT gIc GTG GTC ACC TAT TCG AGC CTC ACA TGG 

Thr Pro cys Ser Gly Ser Trp Leu Arg Asp lie 
5908 ACT CCA TGC TCC GGT TCC TGG CTA AGG GAC ATC 
TGA GGT ACG AGG CCA AGG ACC GAT TCC CTG TAG 

Trn ASD Trp He Cys Glu Val Leu Ser Asp Phe 
^Qdl ?GG GAC TGG ATA TGC GAG GTG TTG AGC GAC TTT 
ACC CTG ACC TAT ACG CTC CAC AAC TCG CTG AAA 

Lvs Thr Trp Leu Lys Ala Lys Leu Met Pro Gin 
.=«^A tlr Inl TGG CTA AAA GCT AAG CTC ATG CCA CAG 
TGG ACC gIt ^ CGA TTC GAG TAC GGT GTC 

Leu Pro Gly He Pro Phe Val Ser Cys Gin Arg 
6007 C?G CC? GGG ATC CCC TTT GTG TCC TGC CAG CGC 
GAC GGA CCC TAG GGG AAA CAC AGG ACG GTC GCG 
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6040 



6073 



6106 



6139 



6172 



6205 



6238 



6271 



6304 



Gly Tyr Lys Gly Val Trp Arg Val Asp Gly lie 
GGG TAT AAG GGG GTC TGG CGA GTG GAC GGC ATC 
CCC ATA TTC CCC CAG ACC GCT CAC CTG CCG TAG 



Met His Thr Arg Cys His Cys Gly Ala Glu He 
ATG CAC ACT CGC TGC CAC TGT GGA GCT GAG ATC 
TAG GTG TGA GC6 ACG GTG ACA CCT CGA CTC TAG 



Thr Gly His Val Lys Asn Gly Thr Met Arg He 
ACT GGA CAT GTC AAA AAC GGG ACG ATG AGG ATC 
TGA CCT GTA CAG TTT TTG CCC TGC TAC TCC TAG 



Val Gly Pro Arg Thr Cys Arg Asn Met Trp Ser 
GTC GGT CCT AGG ACC TGC AGG AAC ATG TGG AGT 
CAG CCA GGA TCC TGG ACG TCC TTG TAC ACC TCA 



Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
GGG ACC TTC CCC ATT AAT GCC TAC ACC ACG GGC 
CCC TGG AAG GGG TAA TTA CGG ATG TGG TGC CCG 



Pro Cys Thr Pro Leu Pro Ala Pro Asn Tyr Thr 
CCC TGT ACC CCC CTT CCT GCG CCG AAC TAC ACG 
GGG ACA TGG GGG GAA GGA CGC GGC TTG ATG TGC 



Phe Ala Leu Trp Arg Val Ser Ala Glu Glu Tyr 
TTC GCG CTA TGG AGG GTG TCT GCA GAG GAA TAT 
AAG CGC GAT ACC TCC CAC AGA CGT CTC CTT ATA 



Val Glu He Arg Gin Val Gly Asp Phe His Tyr 
GTG GAG ATA AGG CAG GTG GGG GAC TTC CAC TAC 
CAC CTC TAT TCC GTC CAC CCC CTG AAG GTG ATG 



Val Thr Gly Met Thr Thr Asp Asn Leu Lys Cys 
GTG ACG GGT ATG ACT ACT GAC AAT CTC AAA TGC 
CAC TGC CCA TAC TGA TGA CTG TTA GAG TTT ACG 
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Pro cys Gin Val Pro Ser Pro Glu Phe Phe Thr 
6337 CCG TGC CAG GTC CCA TCG CCC GAA TTT TTC ACA 
GGC ACG GTC CAG GGT AGC GGG CTT AAA AAG TGT 

Glu Leu Asp Gly Val Arg Leu His Arg Kie Ala 
6370 GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG 
CTT AAC CTG CCC CAC GCG GAT GTA TCC AAA CGC 



Pro Pro Cys 
6403 CCC CCC TGC 
GGG GGG ACG 



Lys Pro Leu Leu 
AAG CCC TTG CTG 
TTC GGG AAC GAC 



Arg Glu Glu Val 
CGG GAG GAG GTA 
GCC CTC CTC CAT 



Ser Phe Arg Val Gly 
6436 TCA TTC AGA GTA GGA 
AGT AAG TCT CAT CCT 



Leu His Glu Tyr Pro Val 
CTC CAC GAA TAG CCG GTA 
GAG GTG CTT ATG GGC CAT 



Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp 
6469 GGG TCG CAA TTA CCT TGC GAG CCC GAA CCG GAC 
CCC AGC GTT AAT GGA ACG CTC GGG CTT GGC CTG 

val Ala Val Leu Thr Ser Met Leu Thr Asp Pro 
6502 GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC 
CAC CGG CAC AAC TGC AGG TAG GAG TGA CTA GGG 



Ser His lie 
6535 TCC CAT ATA 
AGG GTA TAT 



Thr Ala Glu Ala 
ACA GCA GAG GCG 
TGT CGT CTC CGC 



Ala Gly Arg Arg 
GCC GGG CGA AGG 
CGG CCC GCT TCC 



Leu Ala Arg 
6568 TTG GCG AGG 
AAC CGC TCC 



Gly Ser Pro Pro 
GGA TCA CCC CCC 
CCT AGT GGG GGG 



ser Val Ala Ser 
TCT GTG GCC AGC 
AGA CAC CGG TCG 



Ser Ser Ala 
6601 TCC TCG GCT 
AGG AGC CGA 



Ser Gin Leu Ser 
AGC CAG CTA TCC 
TCG GTC GAT AGG 
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Lvs Ala Thr Cys Thr Ala Asn His Asp Ser Pro 
6634 AAG GCA ACT TGC ACC GCT AAC CAT GAC TCC CCT 
TTC CGT TGA ACG TGG GGA TTG GTA CTG AGG GGA 

Asp Ala Glu Leu lie Glu Ala Asn Leu Leu Trp 
6667 GAT GCT GAG CTC ATA GAG GCC AAC CTC CTA TGG 
CTA CGA CTC GAG TAT CTC CGG TTG GAG GAT ACC 



Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
6706 AGG CAG GAG ATG GGC GGC AAC ATC ACC AGG GTT 
TCC GTC CTC TAC CCG CCG TTG TAG TGG TCC CAA 

Glu ser Glu Asn Lys Val Val He Leu Asp Ser 
6733 GAG TCA GAA AAC AAA GTG GTG ATT CTG GAC TCC 
CTC AGT CTT TTG TTT CAC CAC TAA GAC CTG AGG 



Phe Asp Pro Leu Val Ala Glu Glu Asp Glu Arg 
6766 TTC GAT CCG CTT GTG GCG GAG GAG GAC GAG CGG 
AAG CTA GGC GAA CAC CGC CTC CTC CTG CTC GCC 

Glu He ser Val Pro Ala Glu He Leu Arg Lys 
6799 GAG ATC TCC GTA CCC GCA GAA ATC CTG CGG AAG 
CTC TAG AGG CAT GGG CGT CTT TAG GAC GCC TTC 



Ser Arg Arg Phe Ala 
6832 TCT CGG AG A TTC GCC 
AGA GCC TCT AAG CGG 



Gin Ala Leu Pro Val Trp 
CAG GCC CTG CCC GTT TGG 
GTC CGG GAC GGG CAA ACC 



Ala Arg Pro 
6865 GCG CGG CCG 
CGC GCC GGC 



Asp Tyr Asn Pro 
GAC TAT AAC CCC 
CTG ATA TTG GGG 



Pro Leu Val Glu 
CCG CTA GTG GAG 
GGC GAT CAC CTC 



Thr Trp Lys 
6898 ACG TGG AAA 
TGC ACC TTT 



Lys Pro Asp Tyr 
AAG CCC GAC TAC 
TTC GGG CTG ATG 
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vai His Gly CVS Pro Leu Pro Pro Pro Lys Ser 
GTC CAT GGC TGT CCG CTT CCA CCT CCA AAG TCC 
Sg CTA CCG ACA GGC GAA GGT GGA GGT TTC AGG 

Pro Pro Val Pro Pro Pro Arg Lys Lys Arg Thr 
fiQfiA CCT CCT GTG CCT CCG CCT CGG AAG AAG CGG ACG 
gS GGA CAC GGA GGC GGA GCC TTC TTC GCC TGC 

Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala 

6997 GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC 
6997 GTG GTC C"!^ ^ 

Leu Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser 
7030 TTG GCC GAG CTC GCC ACC AGA AGC TTT GGC AGC 
AAC CGG CTC GAG CGG TGG TCT TCG AAA CCG TCG 

Ser Ser Thr Ser Gly He Thr Gly Asp Asn Thr 
•7n<?'? TCC TCA ACT TCC GGC ATT ACG GGC GAC AAT ACG 
AGG AOT TGA AGG CCG TAA TGC CCG CT6 TTA TGC 

Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
7096 ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC TGC 
?GT TGT AGG AGA CTC GGG CGG GGA AGA CCG ACG 

Pro Pro Asp ser Asp Ala Glu Ser Tyr Ser Ser 
7129 CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC 
^" GGG GGG CTG AGG CTG CGA CTC AGG ATA AGG AGG 

Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro 
7162 ATG CCC CCC CTG GAG GGG GAG CCT GGG GAT CCG 
TAC GGG GGG GAC CTC CCC CTC GGA CCC CTA GGC 

ASP Leu Ser Asp Gly Ser Trp Ser Thr Val Ser 
7195 GAT CTT AGC GAC GGG TCA TGG TCA ACG GTC AGT 
CTA GAA TCG CTG CCC AGT ACC AGT TGC CAG TCA 
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Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys 
7228 AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC 
TCA CTC CGG TTG CGC CTC CTA CAG CAC ACG ACG 

Ser Met Ser Tyr Ser Trp Thr Gly Ala Leu Val 
7261 TCA ATG TCT TAC TCT TGG ACA GGC GCA CTC GTC 
AGT TAC AGA ATG AGA ACC TGT CCG CGT GAG CAG 



Thr Pro 
7294 ACC CCG 
TGG GGC 



Cys Ala Ala 
TGC GCC GCG 
ACG CGG CGC 



Glu Glu Gin 
GAA GAA CAG 
CTT CTT GTC 



Lys Leu Pro 
AAA CTC CCC 
TTT GAC GGG 



He Asn Ala Leu Ser Asn Ser Leu Leu Arg His 
7327 ATC AAT GCA CTA AGC AAC TCG TTG CTA CGT CAC 
TAG TTA CGT GAT TCG TTG AGC AAC GAT GCA GTG 



His Asn Leu Val Tyr Ser Thr Thr Ser Arg Ser 
7360 CAC AAT TTG GTG TAT TCC ACC ACC TCA CGC AGT 
GTG TTA AAC CAC ATA AGG TGG TGG AGT GCG TCA 



Ala Cys Gin Arg Gin Lys Lys Val Thr Phe Asp 
7393 GCT TGC CAA AGG CAG AAG AAA GTC ACA TTT GAC 
CGA ACG GTT TCC GTC TTC TTT CAG TGT AAA CTG 



Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp 
7426 AGA CTG CAA GTT CTG GAC AGC CAT TAC CAG GAC 
TCT GAC GTT CAA GAC CTG TCG GTA ATG GTC CTG 



Val Leu Lys Glu Val Lys Ala Ala Ala Ser Lys 
7459 GTA CTC AAG GAG GTT AAA GCA GCG GCG TCA AAA 
CAT GAG TTC CTC CAA TTT CGT CGC CGC AGT TTT 



Val Lys Ala Asn Leu Leu Ser Val Glu Glu Ala 
7492 GTG AAG GCT AAC TTG CTA TCC GTA GAG GAA GCT 
CAC TTC CGA TTG AAC GAT AGG CAT CTC CTT CGA 
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CVS Ser Leu Thr Pro Pro His Ser Ala Lys Ser 
7525 TGC AGC CTG ACQ CCC CCA CAC TCA GCC AAA TCC 
ACG TCG GAC TGC GGG GGT GTG AGT CGG TTT AGG 

Lvs Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys 
7558 AAG TTT GGT TAT GGG GCA AAA GAC GTC CGT TGC 
TTC AAA CCA ATA CCC CGT TTT CTG CAG GCA ACG 



His Ala Arg Lys Ala Val Thr His He Asn Ser 
7591 CAT GCC AGA AAG GCC GTA ACC CAC ATC AAC TCC 
GTA CGG TCT TTC CGG CAT TGG GTG TAG TTG AGG 

Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
7624 GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA 
CAC ACC TTT CTG GAA GAC CTT CTG TTA CAT TGT 



Pro He Asp 
7657 CCA ATA GAC 
GGT TAT CTG 



Thr Thr He Met 
ACT ACC ATC ATG 
TGA TGG TAG TAC 



Ala Lys Asn Glu 
GCT AAG AAC GAG 
CGA TTC TTG CTC 



val Phe cys Val Gin Pro Glu Lys Gly Gly Arg 
7690 GTT TTC TGC GTT CAG CCT GAG AAG GGG GGT CGT 
CAA AAG ACG CAA GTC GGA CTC TTC CCC CCA GCA 

Lvs Pro Ala Arg Leu He Val Phe Pro Asp Leu 
7723 AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG 
TTC GGT CGA GCA GAG TAG CAC AAG GGG CTA GAC 

Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr 
7756 GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC 
CCG CAC GCG CAC ACG CTT TTC TAC CGA AAC ATG 



Asp Val Val 
7789 GAC GTG GTT 
CTG CAC CAA 



Thr Lys Leu Pro 

ACA AAG CTC CCC 
TGT TTC GAG GGG 
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Leu Ala Val Met 
TTG GCC GTG ATG 
AAC CGG CAC TAC 
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Gly Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly 
7822 GGA AGC TCC TAG GGA TTC CAA TAG TCA CCA GGA 
CCT TCG AGG ATG GCT AAG GTT ATG AGT GGT CCT 



Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys 
7855 GAG CGG GTT GAA TTC CTC GTG CAA GCG TGG AAG 
GTC GGC CAA GTT AAG GAG GAG GTT CGG AGC TTC 



Ser Lys Lys Thr Pro Met Gly Phe Ser Tyr Asp 
7888 TCC AAG AAA AGC CCA ATG GGG TTC TCG TAT GAT 
AGG TTC TTT TGG GGT TAG CGG AAG AGG ATA CTA 

Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Ser 
7921 ACC CGC TGG TTT GAG TCC ACA GTC ACT GAG AGC 
TGG GCG AGG AAA GTG AGG TGT GAG TGA CTC TCG 



Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys 
7954 GAG ATG GGT AGG GAG GAG GCA ATG TAG CAA TGT 
GTG TAG GCA TGG CTC CTC GGT TAG ATG GTT ACA 

Cys Asp Leu Asp Pro Gin Ala Arg Val Ala He 
7987 TGT GAG CTC GAG CGG GAA GGC CGG GTG GCG ATG 
ACA GTG GAG GTG GGG GTT GGG GCG GAG CGG TAG 

Lys Ser Leu Thr Glu Arg Leu Tyr Val Gly Gly 
8020 AAG TCC CTC ACC GAG AGG GTT TAT GTT GGG GGG 
TTC AGG GAG TGG CTC TCG GAA ATA CAA CGG GCG 



Pro Leu Thr 
8053 GCT CTT ACC 
GGA GAA TGG 



Asn Ser Arg Gly 
AAT TCA AGG GGG 
TTA AGT TCG GCG 



Glu Asn Cys Gly 
GAG AAG TGG GGG 
CTC TTG ACG GCG 



Tyr Arg Arg 
8086 TAT CGC AGG 
ATA GCG TCC 



Cys Arg Ala Ser 
TGG CGG GCG AGG 
ACG GCG CGG TCG 

FIG. 12-28 



Gly Val Leu Thr 
GGC GTA GTG ACA 
CGG CAT GAG TGT 
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Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr He 
8119 ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC 
TGA TCG ACA CCA TTG TGG GAG TGA ACG ATG TAG 

Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 
8152 AAG GCC CGG GCA GCC TGT CGA GCC GCA GGG CTC 
TTC CGG GCC CGT CGG ACA GCT CGG CGT CCC GAG 

Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
8185 CAG GAC TGC ACC ATG CTC GTG TGT GGC GAC GAC 
GTC CTG ACG TGG TAC GAG CAC ACA CCG CTG CTG 

Leu val val He Cys Glu Ser Ala Gly Val Gin 
8218 TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC CAG 
Ht CAG CAA TAG ACA CTT TCG CGC CCC CAG GTC 

Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr Glu 
8251 GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG 
CTC CTG CGC CGC TCG GAC TCT CGG AAG TGC CTC 

Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp 
8284 GCT ATG ACC AGG TAC TCC GCC CCC CCT GGG GAC 
CGA TAC TGG TCC ATG AGG CGG GGG GGA CCC CTG 

Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu lie 
8317 CCC CCA CAA CCA GAA TAC GAC TTG GAG CTC ATA 
GGG GGT GTT GGT CTT ATG CTG AAC CTC GAG TAT 

Thr Ser Cys Ser Ser Asn Val Ser Val Ala His 
8350 ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC 
TGT AGT ACG AGG AGG TTG CAC AGT CAG CGG GTG 

Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr 
8383 GAC GGC GCT GGA AAG AGG GTC TAC TAC CTC ACC 
CTG CCG CGA CCT TTC TCC CAG ATG ATG GAG TGG 

FIG. 12-29 
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Ara Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala 
8416 CGT Ga£ CCT ACA ACC CCC CTC GCG AGA GCT GCG 

gca ctg gga tgt tgg ggg gag CGC TCT CGA CGC 

Trp Glu Thr Ala Arg His Thr Pro Val Asn Ser 
8449 TGG GAG ACA GCA AGA CAC ACT CCA GTC AAT TCC 
ACC CTC TGT CGT TCT GTG TGA GGT CAG TTA AGG 

Trp Leu Gly Asn He He Met Phe Ala Pro Thr 
8482 TGG CTA GGC AAC ATA ATC ATG TTT GCC CCC ACA 
ACC GAT CCG TTG TAT TAG TAC AAA CGG GGG TGT 

Leu Trp Ala Arg Met He Leu Met Thr His Phe 
8515 C?G TgI GCG AGG ATG ATA CTG ATG ACC CAT TTC 
GAC ACC CGC TCC TAC TAT GAC TAC TGG GTA AAG 

Phe Ser Val Leu He Ala Arg Asp Gin Leu Glu 
8548 TTT AGC GTC CTT ATA GCC AGG GAC CAG CTT GAA 

nn Ala Leu Asp Cys Glu He Tyr Gly Ala Cys 
8581 ciS Gcl CTC gIt TGC GAG ATC TAC GGG GCC TGC 
^ GTC CGG GAG CTA ACG CTC TAG ATG CCC CGG ACG 

Tvr Ser He Glu Pro Leu Asp Leu Pro Pro He 
ft614 TAC TCC ATA GAA CCA CTT GAT CTA CCT CCA ATC 
ATG AGG TAT CTT GGT GAA CTA GAT GGA GGT TAG 

Tie Gin Arq Leu His Gly Leu Ser Ala Phe Ser 
Rfid7 ATT CA^ AgI CTC CAT GGC CTC AGC GCA TTT TCA 
tH G^ TCT GAG GTA CCG GAG TCG CGT AAA AGT 



Leu His Ser 
8680 CTC CAC AGT 
GAG GTG TCA 



Tyr Ser Pro Gly 
TAC TCT CCA GGT 
ATG AGA GGT CCA 

FIG. 1 2-30 



Glu He Asn Arg 
GAA ATT AAT AGG 
CTT TAA TTA TCC 
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8713 



8746 



8779 



8812 



8845 



VaL Ala Ala Cys Leu Arg 
GTG GCC GCA TGC CTC AGA 
CAC CGG CGT ACG GAG TCT 



Lys Leu Gly Val Pro 
AAA CTT GGG GTA CCG 
TTT GAA CCC CAT GGC 



Pro Leu Arg Ala Trp Arg 
CCC TTG CGA GCT TGG AGA 
GGG AAC GCT CGA ACC TCT 



His Arg Ala Arg Ser 
CAC CGG GCC CGG AGC 
GTG GCC CGG GCC TCG 



Val Arg Ala Arg Leu Leu Ala Arg Gly Gly Arg 
GTC CGC GCT AGG CTT CTG GCC AGA GGA GGC AGG 
?AG GCG CgI TCC GAA GAC CGG TCT CCT CCG TCC 



Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp 
GCT GCC ATA TCT GGC AAG tIc CTC TTC AAC TGG 
cS CGG tIt ACA CCG TTC ATG GAG AAG TTG ACC 



Ala Val Arg Thr Lys Leu Lys 
GCA GTA AGA ACA AAG CTC AAA C 
CGT CAT TCT TGT TTC GAG TTT G 



FIG. 12-31 
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primer J159S 
jl ACTGCCCTf^AACTGCAATGA 

PT G 



Ser Leu Lys Thr Gly Phe Leu Ala Ala 
C TCC CTC AAA ACT GGG TTT CTT GCC GCG 
TAG CCCGGTGAG 
Asn Trp Gly 



Leu Phe Tyr Thr His Lys Phe Asn Ala 
29 CTG TTC TAG ACA CAC AAG TTC AAC GCG 

T T CAC T T 

His ser 
primer 166A for Jl-1216 

Ser Gly Cys Pro Glu Arg Met Ala Ser 
56 TCC GGA TGC CCG GAG CGC ATG GCC AGC 
ACTT AGCA 

Leu 



Cys Arg Ser lie Asp Lys Phe Asp Gin 
83 TGT CGC TCC ATT GAC AAG TTC GAC CAG 
C AC C ACGT T 
Pro Leu Thr Asp 



Gly Trp Gly Pro He Thr Tyr Ala Gin 
110 GGA TGG GGT CCC ATC ACC TAT GCT CAA 
C C T GT CAC 

Ser Asn 



Pro Asp Asn Ser Asp Gin Arg Pro Tyr 
137 CCT GAC AAC TCG GAC CAG AGG CCG TAT 
GGA AG GG C C C C C C 

Gly Ser Glv Pro 

Cys Trp His Tyr Ala Pro Arg Gin Cys 
164 TGC TGG CAC TAC GCA CCT CGA CAG TGT 

C C A AA CT C 
Pro Lys pro 



FIG. 13-1 
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Jl 
PT 



Jl 
PT 



Gly He Val Pro Ala Ser Gin Val Cys 
191 GGT ATC GTA CCC GCG TCG CAG GTG TGC 
T G AA AGT T 

Lvs Ser 

Gly Pro Val Tyr Cys Phe Thr Pro Ser 
218 GGT CCA GTG TAT TGC TTC ACC CCA AGC 
G A T C 



Jl 
PT 



Pro val Val Val Gly Thr Thr Asp Arg 
245 CCT GTT GTA GTG GGG ACG ACC GAT CGT 
C G G A CAG 



Jl 
PT 



Phe Gly Ala Pro Thr Tyr Asn Trp Gly 
272 TTC GGC GCC CCT ACG TAT AAC TGG GGG 
CG G C C C G T 

Ser Ser 



Jl 
PT 



Asp Asn Glu Thr Asp Val Leu Leu Leu 
299 GAC AAT GAG ACG GAC GTG CTG CTC CTA 

T C T C G T 

Glu Asp Phe Val 



Jl 
PT 



Asn Asn Thr Arg Pro Pro His Gly Asn 
326 AAC AAC ACG CGG CCC CCG CAC GGC AAC 
T . C A A TG T 

Leu 



Trp Phe Gly Cys Thr 

PGGi 

primer 199A 



Ti 353 TGG TTC GGC TGT ACA 

Jl JDJ luv, ^ rTr,r,ATGAACT rAACTGGATT 



Nucleotide Match: 259/367 (70.6%) 

Amino Acid Match (stringent) : 93/122 (76.2%) 

(relaxed): 111/122 (91.0%) 

FIG. 13-2 



88 



EP 0 419 182 A1 



Prototype HCV (FT) sequences different from 
Japanese HCV (Jl) are shown. 

Relaxed amino acid match: Gly=Ala=Pro=Ser=Thr , 
Asp=Glu, Asn=Gln, 

Aug=Lys=His, Leu=Ile=Val=Met, Phe=Trp=Tyr. 
Underline, different amino acid in relaxed 
Batching. 
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core to NSl vs. HCV-1 

Pro Leu Val 

g^.l ^"^^ ^'^^ 

A ~ — — 



Jl 


11 


Gly Ala 
GGC GCC 


Pro 
CCC 
— T 


Leu Gly 
TTA GGG 
C-T — A 


Gly 

GGC 


Ala Ala Arg 
GCT GCC AGG 


Jl 


38 


Ala Leu 
GCC CTG 


Ala 
GCA 


His Gly 
CAT GGT 


Val 
GTC 


Arg Val Leu 
CGG GTT CTG 


Jl 


65 


Glu Asp 
GAG GAC 


Gly Val Asn Tyr Ala Thr Gly 
GGC GTG AAC TAT GCA ACA GGG 


Jl 


92 


Asn Leu 
AAT TTG 


Pro 
CCC 


Gly Cys 
GGT TGC 


Ser 
TCT 


Phe Ser lie 
TTC TCT ATC 


Jl 


119 


Phe Leu 
TTC CTC 


Leu 
TTG 


Ala Leu 
GCT CTG 


Leu 
CTG 


Ser Cys Leu 
TCC TGT TTG 



Thr He Pro Ala Ser Ala Tyr Glu Val 
Jl 146 ACC ATC CCA GCT TCC GCT TAT GAA GTG 

— T G-G — C G — C — C C 

Val Gin 

Arg Asn Val Ser Gly He Tyr His Val 
Jl 173 CGC AAC GTG TCC GGG ATA TAC CAT GTC 

TCC A-G C-T C - — 

Ser Thr Leu 



Thr Asn Asp Cys Ser Asn Ser Ser lie 
Jl 200 ACA AAC GAC TGC TCC AAC TCA AGC ATT 

««C — T — T C-T G — T 

Pro 



FIG. 14-1 
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Val Tyr Giu Ala Ala Asp Val lie Met 
227 GTG TAT GAG GCG GCG GAG GTG ATC ATG 

C C — T -CC C — 

Ala Leu 



His Ala Pro Gly Cys Val Pro Cys Val 
254 CAT GCC CCC GGG TGC GTG CCC TGC GTT 

— C A-T — G C — T 

Thr 



Arg Glu Asn Asn Ser Ser Arg Cys Trp 
281 CGG GAG AAC AAT TCC TCC CGT TGC TGG 
T GG- — C G — — G A-G — T 

Gly Ala 



Val Ala Leu Thr Pro Thr Leu Ala Ala 
308 GTA GCG CTC ACT CCC ACG CTC GCG GCC 

A A-G — C — T G-G — C A — 

Met val Thr 



Arg Asn Ala Ser Val Pro Thr Thr Thr 
Jl 335 AGG AAT GCC AGC GTC CCC ACT ACG ACA 

G G- -AA C G-G CAG 

Asp Gly Lys Leu Ala 

Gin 



Leu Arg Arg His Val Asp Leu Leu Val 
362 TTA CGA CGC CAC GTC GAC TTG CTC GTT 

C_T T A T C T — C 

lie 



Gly Thr Ala Ala Phe cys Ser Ala Met 
389 GGG ACG GCT GCT TTC TGC TCC GCT ATG 

GC ~C A-C C T ~G — C C-C 

ser Thr Leu Leu 



Jl 



Tyr Val Gly Asp Leu Cys Gly Ser Val 
416 TAC GTG GGG GAT CTC TGC GGA TCT GTT 
Q A — G — C 

FIQ 14-2 
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Phe Leu He Ser Gin Leu Phe Thr Phe 
443 TTC CTC ATC TCC CAG CTG TTC ACC TTC 









Val Gly 








470 


Ser 
TCG 
— T 


Pro 
CCT 
— C 


Arg Arg His Glu 
CGC CGG CAT GAG 
A-G — C — C TG- 
Trp 


Thr 
ACA 
— G 


Val 
GTA 
ACG 
Thr 


Gin 
CAG 
— A 


497 


Asp 
GAC 
-GT 
Gly 


Cys 
TGC 


Asn Cys Ser He 
AAC TGC TCA ATC 


Tyr 
TAT 


Pro 
CCC 


Gly 
GGC 


524 


His 
CAC 


Val 
GTA 

He 


Ser Gly His Arg Met 
TCA GGC CAT CGC ATG 

Thr 


Ala 
GCT 


Trp 
TGG 


551 


Asp 
GAT 


Met Met Met Asn Trp 
ATG ATG ATG AAC TGG 


Ser 
TCG 


Pro 
CCC 


Thr 
ACG 



Ala Ala Leu Val Val Ser Gin Leu Leu 
578 GCA GCC TTA GTG GTG TCG CAG TTA CTC 

A-G — G — G —A A— G-T — - C-G 

Thr Met Ala 

Arg He Pro Gin Ala Val Met Asp Met 
605 CGG ATC CCA CAA GCT GTC ATG GAC ATG 

C A— T 

lie Leu 



Val Ala Gly Ala His Trp Gly Val Leu 
632 GTG GCG GGG GCC CAC TGG GGA GTC CTA 

;^-C — T — T — T — G 

He 

FIG. 14-3 
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Ala Gly Leu Ala 
659 GCG GGC CTT GCC 

A-A — G 

He 



Tyr Tyr Ser Met Val 
TAG TAT TCC ATG GTG 

— T -TC 

Phe 



Gly Asn Trp Ala Lys Val Leu He Val 
686 GGG AAC TGG GOT AAG GTT TTG ATT GTG 

G C C — G-A 

Val 



Met Leu Leu Phe Ala Gly Val Asp Gly 
713 ATG CTA CTC TTT GCC GGC GTT GAC GGG 

C G —A C C- 

Leu Ala 



His Thr Arg Val Thr Gly Gly Val Gin 

740 CAT ACC CGC GTG ACG GGG GGG GTG CAA 

A C — C A AGT GCC 

Glu His Ser Ala 



Gly His Val Thr 
767 GGC CAC GTC ACC 

ACT GTG 

Thr Val 



Leu Phe Arg Pro 
794 CTC TTT AGA CCT 

C-C GC A 

Leu Ala 



Ser Thr Leu Thr Ser 
TCT ACA CTC ACG TCC 

GGA T-T GTT AG- 

Gly Phe Val 



Gly Ala Ser Gin Lys 
GGG GCG TCC CAG AAA 

— C — C AAG C C 

Lys Asn 



He Gin Leu Val Asn Thr Asn Gly Ser 
821 ATT CAG CTT GTA AAC ACC AAT GGC AGT 

G_C G A-C C 

Val He 



Trp His He Asn Arg Thr Ala Leu Asn 
848 TGG CAT ATC AAC AGG ACT GCC CTG AAC 
C C T — C ~G 

Ser 



FIG. 14-4 
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Cys Asn Asp Ser Leu Gin Thr Gly Phe 
875 TGC AAT GAC TCC CTC CAA ACT GGG TTC 

T AG — A-C — C — C -GG 

Asn Trp 



902 



Leu 


Ala 


Ala 














CTT 


GCC 


GCG 


CTG 


TTC 


TAC 


ACA 


CAC 


AAG 


T-G 












CAC 


C — 








Gly 








His 






TTC 


AAC 


GCG 


TCC 


GGA 


TGC 


CCG 


GAG 


CGC 
















A-G 



929 

Ser 



Ser lie Asp Lys 
ATG GCC AGC TGT CGC TCC ATT GAC AAG 

956 C-A C ~A C— C— AC- G-T 

Leu Pro Leu Thr Asp 



Phe Asp Gin Gly Trp Gly Pro He Thr 
983 TTC GAC CAG GGA TGG GGT CCC ATC ACC 

T c C — T GT 

Ser 



Tyr Ala Gin Pro Asp Asn Ser Asp Gin 
1010 TAT GCT CAA CCT GAC AAC TCG GAC CAG 

C AAC GGA AGC GG- C-C 

Asn Gly Ser Gly Pro 



Arg Pro Tyr Cys Trp His Tyr Ala Pro 
1037 AGG CCG TAT TGC TGG CAC TAC GCA CCT 

C-C — C — C C-C A 

Pro 



Arg Gin Cys Gly He Val Pro Ala Ser 
1064 CGA CAG TGT GGT ATC GTA CCC GCG TCG 

CT — C T — G AA- 

Lys Pro Lys 



FIG. 14-5 
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Gin Val Cys Gly Pro Val Tyr Cys Phe 
1091 CAG GTG TGC GGT CCA GTG TAT TGC TTC 

ACT - — ~T — G — A 

Ser 



Thr Pro ser Pro Val Val Val Gly Thr 
1118 ACC CCA AGC CCT GTT GTA GTG GGG ACG 
T — C - — — C — G — G ~A 



Thr Asp Arg Phe Gly Ala Pro Thr Tyr 
1145 ACC GAT CGT TTC GGC GCC CCT ACG TAT 

A-G -CG G — C — C — e 

Ser 



Asn Trp Gly Asp Asn Glu Thr Asp Val 
1172 AAC TGG GGG GAC AAT GAG ACG GAC GTG 

.Q T —A T C 

Ser Glu Asp 



Leu Leu Leu Asn Asn Thr Arg Pro Pro 
1199 CTG CTC CTA AAC AAC ACG CGG CCC CCG 

T-C G — — T T — C A A 

Phe Val 



His Gly Asn Trp Phe Gly Cys Thr 
1226 CAC GGC AAC TGG TTC GGC TGT ACA 
.TG T T 

FIG. 14-6 
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Jl 1 

HCV-1 



Gly Asn Trp Phe Gly cys Thr Trp Met 
TG GGC AAC TGG TTC GGC TGT ACA TGG ATG 
T T ^ 



Jl 30 
HCV-1 



Asn Ser Thr Gly Phe Thr Lys Thr Cys 
AAT AGO ACT GGG TTC ACC AAG ACG TGC 

— C TCA A A GT- 

Val 



Jl 57 
HCV-1 



Glv Gly Pro Pro Cys Asn lie Gly Gly 
GGA GGC CCC CCG TGT AAC ATC GGG GGG 

CG — T — T GT A 

Val 



Jl 84 
HCV-1 



val Gly Asn Asn Thr Leu Thr Cys Pro 
GTC GGC AAC AAC ACC TTG ACC TGC CCC 
-CG 



Ala 



His 



Jl 111 
HCV-1 



Thr Asp cys Phe Arg Lys Thr Pro Thr 
ACG GAC TGC TTC CGG AAG ACC CCG ACG 

!: m c CAT GAC 

His Asp 



Jl 138 
HCV-1 



Ala Thr Tyr Thr Lys cys Gly Ser Gly 
GCC ACT TAC ACA AAA TGT GGT TCG GGC 

A T-T CGG — C — C — C — T 

Ser Arg 



Jl 165 
HCV-1 



Pro Trp Leu Thr Pro Arg Cys Leu Val 
CCT TGG TTG ACA CCT AGG TGC TTG GTT 

..C A-C C C— — C 

He 



Jl 192 
HCV-1 



Asp Tyr Pro Tyr Arg Leu Trp His Tyr 
GAC TAC CCA TAC AGG CTC TGG CAC TAC 
Q rp T — T — T 

FIG. 15-1 
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Pro Cys Thr Val Asn Phe Thr lie Phe 
CCC TGC ACT GTC AAC TTT ACC ATC TTC 

— rp — T — C A — -AC — A — T 

He Tyr 



Lys Val Arg Met Tyr Val Gly Gly Val 
AAG GTT AGG ATG TAT GTG GGG GGC GTG 

~A A-C C A — G — C 

He 



Glu His 
GAG CAC 

FIG. 15-2 
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C200 region sequence vs. HCV-1 

Asn Met Ser 
3799 AAT ATG TCC 

HCV-1 3781 ACA CTG GGC TTT GGT GCT T-C 

Thr Leu Gly Phe Gly Ala Tyr 

Lys Ala His Gly Thr Asp Pro Asn He 
C200 3808 AAG GCA CAT GGC ACC GAC CCC AAC ATC 

HCV-1 T G -T T — T 

lie 



Arg Thr Gly Val Arg Thr He Thr Thr 
C200 3835 AGA ACT GGG GTA AGG ACC ATC ACC ACA 
HCV-1 — G — C G —A —A — T T 

Gly Ala Pro He Thr Tyr Ser Thr Tyr 
C200 3862 GGT GCC CCC ATT ACG TAC TCC ACC TAT 

HCV-1 — C AG C C 

Ser 



Arg Lys Phe Leu Ala Asp Gly Gly Cys 
C200 3889 CGC AAG TTC CTT GCC GAC GGT GGT TGC 
HCV-1 G ^ 

Gly 

ser Gly Gly Ala Tyr Asp He He 
C200 3916 TCC GGG GGC GCC TAT GAC ATC ATA A 
HCV-1 -G — T A ~ 



HCV-l 3943 TGT GAC GAG TGC CAC TCC ACG GAT GCC 
cys Asp Glu cys His Ser Thr Asp Ala 

HCV-1 3970 ACA TCC ATC TTG GGC ATC GGC ACT GTC 
HCV 1 3970 AJ,^ 

FIG. 16-1 
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HCV-1 3997 CTT GAC CAA GCA GAG ACT GCG GGG GCG 
Leu Asp Gin Ala Glu Thr Ala Gly Ala 

HCV-1 4024 AGA CTG GTT GTG CTC GCC ACC GCC ACC 
Arg Leu Val Val Leu Ala Thr Ala Thr 

HCV-1 4051 CCT CCG GGC TCC GTC ACT GTG CCC CAT 

HCV-1 4078 CCC AAC ATC GAG GAG GTT GCT CTG TCC 
Pro Asn He Glu Glu Val Ala Leu Ser 

HCV-1 4105 ACC ACC GGA GAG ATC CCT TTT TAG GGC 
Thr Thr Gly Glu He Pro Phe Tyr Gly 

Ser He Pro He Glu Ala He Lys 
C200 4132 A AGC ATC CCC ATC GAG GCC ATC AAG 

HCV-1 AAG GCT C A -TA - 

Lys Ala 



Gly Gly Arg His Leu He Phe Cys His 
GGG GGA " """^ '""^ 

HCV-1 G 



C200 4159 GGG GGA AGG CAT CTC ATC TTC TGC CAT 



Ser Lys Lys Lys Cys Asp Glu Leu Ala 
C200 4186 TCC AAG AAG AAG TGT GAC GAG CTC GCC 
HCV-1 —A C A 

Ala Lys Leu Ser Ala Leu Gly Leu Asn 
C200 4213 GCA AAG CTG TCA GCC CTC GGA CTC AAT 
HCV-1 - GTC -A T-G -C A--- — 

Val 



Ala Val Ala Tyr Tyr Arg Gly Leu Asp 
C200 4240 GCC GTG GCG TAT TAC CGC GGT CTT GAT 
HCV-1 C — C C 



FIG. 16-2 
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Val Ser Val lie Pro Thr Ser Gly Asp 
C200 4267 GTG TCC GTC ATA CCA ACT AGC GGA GAC 
HCV-1 C — G — C C ~T 



Val Val Val Val Ala Thr Asp 
C200 4294 GTC GTT GTC GTG GCA ACA GAC GC^4316 
HCV-1 



.fji Q — C — T — C CTC 



HCV-1 4321 ATG ACC GGC TAT ACC GGC GAC TTC GAC 
Met Thr Gly Tyr Thr Gly Asp Phe Asp 

HCV-1 4348 TCG GTG ATA GAC TGC AAT ACG TGT GTC 
Ser Val He Asp Cys Asn Thr Cys Val 

HCV-1 4375 ACC CAG ACA GTC GAT TTC AGC CTT GAC 
Thr Gin Thr Val Asp Phe Ser Leu Asp 

HCV-1 4402 CCT ACC TTC ACC ATT GAG ACA ATC ACG 
Pro Thr Phe Thr He Glu Thr He Thr 

HCV-1 4429 CTC CCC CAG GAT GCT GTC TCC CGC ACT 
Leu Pro Gin Asp Ala Val Ser Arg Thr 

HCV-1 4456 CAA CGT CGG GGC AGG ACT GGC AGG GGG 
Gin Arg Arg Gly Arg Thr Gly Arg Gly 

HCV-1 4483 AAG CCA GGC ATC TAC AGA TTT GTG GCA 
Lys Pro Gly He Tyr Arg Phe Val Ala 

HCV-1 4510 CCG GGG GAG CGC CCC TCC GGC ATG TTC 
Pro Gly Glu Arg Pro Ser Gly Met Phe 

HCV-1 4537 GAC TCG TCC GTC CTC TGT GAG TGC TAT 
Asp Ser Ser Val Leu Cys Glu Cys Tyr 

FIG. 16-3 
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HCV-1 4564 GAC GCA GGC TGT GCT TGG TAT GAG CTC 
Asp Ala Gly Cys Ala Trp Tyr Glu Leu 

HCV-1 4591 ACG CCC GCC GAG ACT ACA GTT AGG CTA 
Thr Pro Ala Glu Thr Thr Val Arg Leu 

HCV-1 4618 CGA GCG TAC ATG AAC ACC CCG GGG CTT 
Arg Ala Tyr Met Asn Thr Pro Gly Leu 

HCV-1 4645 CCC GTG TGC CAG GAC CAT CTT GAA TTT 
Pro Val cys Gin Asp His Leu Glu Phe 

HCV-1 4672 TGG GAG GGC GTC TTT ACA GGC CTC ACT 
Trp Glu Gly Val Phe Thr Gly Leu Thr 

HCV-1 4699 CAT ATA GAT GCC CAC TTT CTA TCC CAG 
His He Asp Ala His Phe Leu Ser Gin 

HCV-1 4726 ACA AAG CAG AGT GGG GAG AAC CTT CCT 
Thr Lys Gin Ser Gly Glu Asn Leu Pro 

HCV-1 4753 TAC CTG GTA GCG TAC CAA GCC ACC GTG 
Tyr Leu Val Ala Tyr Gin Ala Thr Val 

HCV-1 4780 TGC GCT AGG GCT CAA GCC CCT CCC CCA 
Cys Ala Arg Ala Gin Ala Pro Pro Pro 

HCV-1 4807 TCG TGG GAC CAG ATG TGG AAG TGT TTG 
Ser Trp Asp Gin Met Trp Lys Cys Leu 

HCV-1 4834 ATT CGC CTC AAG CCC ACC CTC CAT GGG 
He Arg Leu Lys Pro Thr Leu His Gly 

HCV-1 4861 CCA ACA CCC CTG CTA TAC AGA CTG GGC 
Pro Thr Pro Leu Leu Tyr Arg Leu Gly 

FIG. 16-4 
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HCV-1 4888 GCT GTT CAG AAT GAA ATC ACC CTG ACG 
Ala Val Gin Asn Glu He Thr Leu Thr 

HCV-l 4915 CAC CCA GTC ACC AAA TAG ATC ATG ACA 
HCV 1 ^i^x ^j^^ ^yj. 

HCV-1 4942 TGC ATG TCG GCC GAC CTG GAG GTC GTC 
HCV 1 ^g^l val 

HCV-1 4969 ACG AGC ACC TGG GTG CTC GTT GGC GGC 
Thr Ser Thr Trp Val Leu Val Gly Gly 

HCV-1 4996 GTC CTG GCT GCT TTG GCC GCG TAT TGC 
val Leu Ala Ala Leu Ala Ala Tyr Cys 

HCV-1 5023 CTG TCA ACA GGC TGC GTG GTC ATA GTG 

HCV 1 5023 CIV, 

HCV-1 5050 GGC AGG GTC GTC TTG TCC GGG AAG CCG 
HCV 1 pj^Q 

Glu Val Leu 

C200 ^^"^ 

HCV-1 5077 GCA ATC ATA CCT GAC AGG 

Ala He He Pro Asp Arg 

Tyr Arg Glu Phe Asp Glu Met Glu Glu 
C200 5104 TAC CGA GAG TTC GAT GAG ATG GAA GAG 
HCV-1 ~~ 

Cys Ala Ser His Leu Pro Tyr He Glu 
C200 5131 TGC GCC TCA CAC CTC CCC TAC ATC GAA 

HCV-1 T-T CAG T-A — G G 

Ser Gin 

RG. 16-5 
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Gin Gly Met Gin Leu Ala Glu Gin Phe 
C200 5158 CAG GGA ATG CAG CTC GCC GAG CAA TTC 

HCV-1 —A — G AT G 

Met 



Lys Gin Lys Ala Leu Gly Leu Leu Gin 
C200 5185 AAG CAG AAG GCG CTC GGG TTG CTG CAA 
HCV-1 C C C-C G 

Thr Ala Thr Lys Gin Ala Glu Ala Ala 
C200 5212 ACA GCC ACC AAG CAA GCG GAG GCT GCT 

C — G T— CGT — -G —A T- ATC 

Ser Arg Val He 



HCV-1 



Ala Pro Cys Glu Ser Met His Ala Ser 
C200 5239 GCT CCG TGT GAG TCA ATG CAC GCC TCG 

HCV-1 — C — T GC TC CAG -CC A— TGG CAA 

Ala Val Gin Thr Asn Trp Gin 



HCV-1 ^^^^ -AA CTC GAG ACC TTC TGG GCG AAG CAT 
Lys Leu Glu Thr Phe Trp Ala Lys His 

HCV-1 5293 ATG TGG AAC TTC ATC AGT GGG ATA CAA TA 
Met Trp Asn Phe He Ser Gly He Gin 

FIG. 16-6 
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HCV-1 



Jl 

HCV-1 



Jl 

HCV-1 
Ala 



Jl 

HCV-1 



HCV-1 



Jl 

HCV-1 



HCV-1 



" EP 0 419 182 A1 

NSl Sequence vs. HCV-1 



Leu Gly Asn Trp Phe Gly Cys Thr Trp 
1 G TTG GGC AAT TGG TTC GGT TGC ACC TGG 



- C- 



Met Asn ser Ser Gly Phe Thr Lys Val 
29 ATG AAC TCA TCT GGA TTT ACC AAA GTG 

A C 

Thr 

cys Gly Ala Pro Pro Cys Val lie Gly 
56 TGC GGA GCG CCT CCT TGT GTC ATC GGA 



Glv Val Gly Asn Asn Thr Leu Gin Cys 
Jl 83 GGG GTG GGC AAC AAC ACC TTG CAA TGC 

Pro Thr Asp Cys Phe Arg Lys His Pro 
110 CCC ACT GAC TGT TTC CGC AAG CAT CCG 
T 



AsD Ala Thr Tyr Ser Arg Cys Gly Ser 
Jl 137 GAC GCC ACA TAC TCT CGG TGC GGT TCC 



Gly Pro Trp He Thr Pro Arg Cys Leu 
164 GGT CCC TGG ATT ACG CCC AGG TGC CTG 
— C — A — " 



Val His Tyr Pro Tyr Arg Leu Trp His 
Jl 191 GTC CAC TAC CCT TAT AGG CTT TGG CAT 



G G 

Asp 



Tvr Pro cys Thr Val Asn Tyr Thr Leu 
Jl 218 TAT CCC TGT ACT GTC AAC TAC ACC TTG 
HCV-1 T lie 

FIG. 17-1 
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Jl 

HCV-1 



Phe Lys Val Arg Met Tyr Val Gly Gly 
245 TTC AAA GTC AGG ATG TAG GTG GGA GGG 

— T A — — 

He 



Jl 

HCV-1 



Val Glu His Arg Leu Glu Val Ala Cys 
272 GTC GAG CAC AGG CTG GAA GTT GOT TGC 

A C C 

Ala 



Jl 

HCV-1 



Asn Trp Thr Arg Gly Glu Arg Cys Asp 
299 AAC TGG ACG CGG GGC GAG CGT TGT GAT 
--A — C 



Leu Asp Asp Arg Asp 
Jl 326 CTG GAC GAC AGG GAC A 

HCV-1 A 

Glu 

FIG. 17-2 
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Core Sequence vs. HCV-1 

1 GCGTCTAGCCATGGCGTTAGTATGAGTGTC 

HCV-1 

31 GTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCC 

HCV-1 *" 

jl 66 ATAGTGGTCTGCGGAACCGGTGAGTACACCGGAAT 

HCV-1 

jl 101 TGCCAGGACGACCGGGTCCTTTCTTGGATCAACCC 

HCV-1 

Jl 136 GCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCGA 

HCV-1 ^~ 

Jl 171 GACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGC 

HCV-1 ' " 

Jl 206 CTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGC 

HCV-1 

Met Ser 

Jl 241 CCCGGGAGGTCTCGTAGACCGTGCATCATG AGC 



■C- 



Thr Asn Pro Lys Pro Gin Arg Lys Thr 
Jl 274 ACA AAT CCT AAA CCT Ckh AGA AAA ACC 

HCV-1 — G -A 

Lys Asn 

Lys Atg Asn Thr Asn Arg Arg Pro Gin 
Jl 301 AAA CGT AAC ACC AAC CGC CGC CCA CAG 
HCV-1 " 

Asp Val Lys Phe Pro Gly Gly Gly Gin 
Jl 328 GAC GTC AAG TTC CCG GGC GGT GGT CAG 

HCV-1 ^ 

He Val Gly Gly Val Tyr Leu Leu Pro 
Jl 355 ATC GTT GGT GGA GTT TAG CTG TTG CCG 

HCV-1 

Arg Arg Gly Pro Arg Leu Gly Val Arg 
Jl 382 CGC AGG GGC CCC AGG TTG GGT GTG CGC 
HCV-1 T —A 

FIG. 18-1 
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Ji 

HCV-l 



Jl 

HCV-l 



Jl 

HCV-l 



Jl 

HCV-l 



Jl 

HCV-l 



Jl 

HCV-l 



Ala Thr Arg Lys Thr Ser Glu Arg Ser 
409 GCG ACT AGG AAG ACT TCC GAG CGG TCG 
G — A 

Gin Pro Arg Gly Arg Arg Gin Pro He 
436 CAA CCT CGT GGA AGG CGA CAA CCT ATC 
— A — T — A T-T — G 

Pro Lys Ala Arg Gin Pro Glu Gly Arg 
463 CCC AAG GCT CGC CAG CCC GAG GGC AGG 

Arg 

Ala Trp Ala Gin Pro Gly Tyr Pro Trp 
490 GCC TGG GCT CAG CCC GGG TAC CCT TGG 

A 

Thr 

Pro Leu Tyr Gly Asn Glu Gly Met Gly 
517 CCC CTC TAT GGC AAC GAG GGC ATG GGG 

— T TGC 

Cys 

Trp Ala Gly Trp Leu 
544 TGG GCA GGA TGG CTC CT 

FIG. 18-2 
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